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Surface-located Streptococcus pneumoniae polypeptides 

All patent and non-patent references cited in this application are hereby incorporated 
by reference in their entirety. This patent application claims the benefit of priority from 
5 U.S. Provisional Application Serial No. 60/653,932, filed February 18, 2005, which is 
incorporated herein by reference in its entirety. 

Field of the invention 

The present invention relates to cell-surface-located polypeptides of Streptococcus 
10 pneumoniae and their use in immunisation against Streptococcal infection, in diagno- 
sis of Streptococcus and in identification of compounds with anti-Streptococcus activ- 
ity. 

Background of the invention 

15 Occurrence of Streptococcus infections 

Sternberg and Pasteur were the first to identify Streptococcus pneumoniae, initially 
described as the pneumococcus (Austrian R. The pneumococcus at the millennium: 
not down, not out. J Infect Dis 1999;179 (Suppl 2):S338-41). Streptococcus pneumo- 
niae is a Gram-positive encapsulated coccus. Based on differences in the composi- 

20 tion of the polysaccharide capsule, about 90 serotypes are identified. This capsule is 
an essential virulence factor. The majority of pneumococcal disease in infants is as- 
sociated with a small number of these serotypes, which may vary by region. Current 
data suggest that the 1 1 most common serotypes cause at least 75% of invasive dis- 
ease in all regions. 

25 

Streptococcus pneumoniae is a human pathogen. The reservoir for pneumococci is 
presumably the nasopharynx of asymptomatic human carriers. There is no animal or 
insect vector. Streptococcus pneumoniae is the most common cause of bacteraemia, 
pneumonia, meningitis and otitis media in young children. Pneumococcal disease is a 

30 very serious illness in young children. In the United States it is estimated that Strepto- 
coccus pneumoniae cause 200 deaths, 700 cases of meningitis, 17,000 cases of bac- 
teraemia, 4.9 million cases of otitis media (ear infections) annually in children under 5 
years of age. In Europe and the United States, pneumococcal pneumonia is the most 
common community-acquired bacterial pneumonia, estimated to affect approximately 

35 100 per 100 000 adults each year. The corresponding figures for febrile bacteraemia 
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and meningitis are 15-19 per 100,000 and 1-2 per 100,000, respectively. The risk for 
one or more of these manifestations is much higher in infants and elderly people. 

Meningitis is the most severe type of pneumococcal disease. Of children under 5 
5 years with pneumococcal meningitis, about 5% will die of their infection and others 
may have long-term problems such as hearing loss. Many children with pneumococ- 
cal pneumonia or blood stream infections will be ill enough to be hospitalized; about 
1% of children with blood stream infections or pneumonia with a blood stream infec- 
tion will die of their illness. Nearly all children with ear infections recover, although 
10 children with recurrent infections can suffer hearing loss. 



At serious risk are also patients taking immunosuppressive chemotherapy, those 
with congenital and acquired immune deficiency (including HIV infections) and those 
with chronic renal disease. Table. 1: The major disease indications and the number 
15 of hospitalised patients as well as case fatality rates in children and the elderly, 
which occur per annum in the US: 



Disease indication 


Pneumococcal 
pneumonia 


Pneumococcal 
bacteraemia 


Pneumococcal 
meningitis 


Hospitalised patients/ 
annum in the US 


175.000 


50.000 


3.000-6.000 


Case fatality rate 
children/elderly 


5-7%/higher 


20%/60% 


30%/80% 



Tab!© 2, incidence case-fatality ratio, projected U.S. cases and deaths, and proportion nonsuseeptible to penicillin of 
invasive disease Identified in the Active Bacterial Core surveillance (ABCs'3, 1998 





Group A 


Group B 


Haemophilus 


Neisseria 


Streptococcus 




Streptococcus 


Streptococcus 


influenzae 


nwn tngitidig 


pneumoniae 


Aggregate incidence* 1 




6.5 


1.4 


X j. 0 


24,1 


Range by area a 




<L8 - 8,5 


1*1 - 2,3 


0.6 - 2.0 


20.0-28.9 


Case- fatality ratio 




9,15 % 


13,9% 


13.7% 




in A'BCs areas 












Projected U,S. cases 


10,200 


17.400 


3,900 


2,500 




Projected U.S. deaths 


1,300 


1.700 


500 


400 


6.100 


Penicillin nonsuscepiibilitv b 0 


0 




1.1ft 


25.0S& 



incidence = eases per 100,000, 

b Nonsruseeptible includes isolates classified as either intermediate or resistant to penicillin. Results reflect testing of group A 
;;trefptr>eocca] isolates from 1997 Cn=lB3) and group B streptococcal isolates from 1997 and combined faslSS). 



Schuchat, A et al. "Active Bacterial Core Surveillance of the Emerging Infections. 
20 Program Network", Emerging Infectious Diseases, Vol. 7, No.1, Jan-Feb 2001. 
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Symptoms of Streptococcus pneumoniae infections 

Pneumococcal pneumonia is the most common clinical presentation of pneumococcal 
5 disease among adults. The incubation period of pneumococcal pneumonia is short, 
about 1 to 3 days. Symptoms generally include an abrupt onset of fever and chills or 
rigors. Typically there is a single rigor, and repeated shaking chills are uncommon. 
Other common symptoms include pleuritic chest pain, cough productive of mucopuru- 
lent, rusty sputum, dyspnea (shortness of breath), tachypnea (rapid breathing), hy- 
10 poxia (poor oxygenation), tachycardia (rapid heart rate), malaise, and weakness. 

Treatment and prevention of Streptococcus pneumoniae infections 
The emerging resistance to penicillin and other commonly used antibiotics under- 
scores the importance of the development of novel strategies to combat pneumococ- 
15 cal disease. In some areas of the U.S. up to 40% of invasive pneumococcal isolates 
are resistant to penicillin. Treatment will usually include a broad spectrum cepha- 
losporin, and often vancomycin, until results of antibiotic sensitivity testing are avail- 
able. 

There are two vaccines against Streptococcus pneumoniae available on the market: 

1. Prevnar® (Wyeth), a 7-valent pneumococcal conjugate vaccine, containing 
polysaccharides of serotype 4, 6B, 9V, 14, 18C, 19F and 23F. 

2. Pneumovax® (Merck Research Laboratories), a 23-valent polysaccharide 
vaccine containing 23 purified capsular polysaccharide antigens (serotypes 1 , 
2, 3, 4, 5, 6B, 7F, 8, 9N, 10A, 11A, 12F, 14, 15B, 17F, 18C, 19A, 19F, 20, 
22F, 23F, and 33F). 

However, there is still a large medical need for development of improved Streptococ- 
cus vaccines, because: 
30 

o These vaccines only cover certain serotypes, e.g. Prevnar® has a potential 
coverage of over 85% of the pneumococcal isolates for the USA, 60-70% for 
Europe and around 55% for Asia. 



20 



25 
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o Children under 2 years of age, who suffer the highest rates of pneumococcal 
carriage and disease, and immunocompromised patients show a severely im- 
paired antibody response upon this vaccination. 

o The polysaccharide vaccines are not effective against acute otitis media 
caused by Streptococcus pneumoniae. 

o The polysaccharide vaccines do not induce a T-cell-dependent immune re- 
sponse. This implicates the absence of memory B cells and limits the period of 
protection. 

o Several of the capsule polysaccharides are poorly immunogenic. These in- 
clude several serotypes associated with penicillin resistance. 

Currently, several pneumococcal surface proteins are considered as alternative vac- 
cine candidates because of their serotype-independence. However, so far, none of 
the proteins are considered to elicit species-wide pneumococcal protection. This can 
be explained by the occurrence of allelic variation within most individual proteins. An- 
tibodies raised against a single protein may not recognize allelic variants. Efficacy 
against pneumonia is an important factor in deciding on the use of new vaccines in 
developing countries. 

In addition to better ways of treatment and prevention, there is a need for novel rapid 
and reliable methods for diagnosis of Streptococcus pneumoniae infections. 

The above objectives can be accomplished through the identification and use of suit- 
able Streptococcus pneumoniae polypeptides that can function as targets, i.e. targets 
for the immune system and/or for antibodies, targets for cytotoxic inhibitors, or targets 
for indicator moieties in diagnosis. 

Summary of the invention 

The present application relates to surface-located polypeptides of Streptococcus 
pneumoniae. In the context of this application, a 'surface-located' polypeptide is de- 
fined as a polypeptide which is at least partially (i.e. part of the polypeptide chain 
and/or part of the population of polypeptide molecules) localised outside the mem- 
brane of a Streptococcus pneumoniae cell. Thus, a surface-located polypeptide is a 
polypeptide which is fully or partially exposed to the space outside the membrane. 
Surface-located polypeptides furthermore include all polypeptides or polypeptide 
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fragments that can be identified in fractions obtained by high-pH surface-protein ex- 
traction or mutanolysin digestion as described herein. 

Surface-located polypeptides are attractive targets for antibacterial therapy 
and/or diagnosis of bacterial infection, since the exposure of such polypeptides to the 
5 extracellular space means that compounds that interact with these polypeptides (e.g. 
compounds used to prevent, treat or diagnose bacterial infections) often do not need 
to enter or pass the membrane to be effective. 

The determination of cell-surface localisation of a Streptococcus pneumoniae 
polypeptide can at present only be done experimentally and not by bioinformatics, as 

10 no common sorting signals or motifs are known for this localisation. It is possible to 
predict with some degree of certainty whether or not polypeptides enter the perip- 
lasm, but no general motif has been identified for surface-localisation of polypeptide. 
Prior art strategies for the identification of candidates for protein vaccination against 
Streptococcus pneumoniae have mainly been based on genome sequencing and in 

15 silico analysis (WO 02/077021; Wizemann et al. (2001) Infect. Immun. 69:1593- 
1598). These strategies have not been very successful, as only a small subpopulation 
of the candidates identified and tested conferred protection in a mouse model (Wize- 
mann et al. (2001) Infect. Immun. 69:1593-1598). 

The inventors have identified 282 different polypeptides in cell-surface frac- 

20 tions of Streptococcus pneumoniae. The method that was employed identifies poly- 
peptides that are expressed at a relatively high level. The combination of being sur- 
face-exposed and being present in relatively high amounts makes these polypeptides 
highly suitable as targets for antibodies and thus for use in passive or active immuni- 
sation/vaccination. 

25 

Accordingly, in a first aspect, the invention relates to a composition comprising 

- a polypeptide which comprises a sequence selected from the group consisting of 
surface-located Streptococcus polypeptides of SEQ ID NO:1-282, or comprises 
an antigenic fragment or variant of said sequence, 

30 or 

- a polynucleotide comprising a sequence encoding said polypeptide, 
or 

an expression vector comprising a sequence encoding said polypeptide, 
or 
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- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 
or 

an antibody capable of binding said polypeptide, 
5 for use as a medicament. 



In a preferred embodiment, said composition comprises 

- a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, and SEQ ID NO:28, or comprises 

10 an antigenic fragment or variant of said sequence, 

or 

a polynucleotide comprising a sequence encoding said polypeptide, 
or 

an expression vector comprising a sequence encoding said polypeptide, 
15 or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

In an even more preferred embodiment, said composition comprises 

- a polypeptide which comprises SEQ ID NO:16, or comprises an antigenic 
fragment or variant of SEQ ID NO: 16, 
or 

a polynucleotide comprising a sequence encoding said polypeptide, 
or 

an expression vector comprising a sequence encoding said polypeptide, 
or 

a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

30 SEQ ID NO:16 represents a homolog of lipoate-protein ligase A, an enzyme which 
has previously been identified and characterised in E. coli and L. monocytogenes 
(Morris et al. (1994) J. Biol. Chem. 269:16091; O'Riordan et al. (2003) Science 
302:462). Proteins of this family have not previously been identified on the cell sur- 
face or found to be vaccine candidates or suitable targets for antibody therapy. 

35 



20 



25 
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In another preferred embodiment, said composition comprises an antibody capable of 
binding a polypeptide selected from the group consisting of SEQ ID NO:16, SEQ ID 
NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID 
NO:33. In an even more preferred embodiment, said polypeptide is SEQ ID NO:16. 

5 

In a further main aspect, the invention relates to the use of a composition comprising 
a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO: 1-282, or comprises an antigenic fragment or variant of said 
sequence, 

10 a polynucleotide comprising a sequence encoding said polypeptide, 

an expression vector comprising a sequence encoding said polypeptide, or 
- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

for the preparation of a medicament for the immunisation of an animal or human 
15 being against bacteria, preferably Streptococcus, more preferably Streptococcus 
pneumoniae, infections. Preferred sequences are SEQ ID NO:16, SEQ ID NO:10, 
SEQ ID NO:13, and SEQ ID NO:28. Most preferred SEQ ID NO:16. 

In a further main aspect, the invention relates to an antibody capable of binding a 
20 polypeptide selected from the group consisting of SEQ ID NO: 1-282. 

Furthermore, the invention relates, in a main aspect, to the use of an antibody 
capable of binding a polypeptide selected from the group consisting of SEQ ID NO:1- 
282 for the manufacture of a medicament for the treatment or prevention of 
25 Streptococcus, preferably Streptococcus pneumoniae, infections in an animal or 
human being. The use of antibodies capable of binding a polypeptide selected from 
the group consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID 
NO:20, SEQ ID NO:26, SEQ ID NO:28 or SEQ ID NO:33 is preferred. 

30 Most preferred is the use of an antibody capable of binding the polypeptide of SEQ ID 
NO:16 for the manufacture of a medicament for the treatment or prevention of 
Streptococcus, preferably Streptococcus pneumoniae, infections in an animal or 
human being. 
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The combination of being surface-exposed and being present in relatively high 
amounts also makes the polypeptides identified by the inventors highly suitable as 
targets for diagnosis of Streptococcus pneumoniae infection, allowing detection of 
intact cells with high sensitivity. Thus, in a further main aspect, the invention relates to 
5 methods for detecting Streptococcus pneumoniae or parts thereof, using indicator 
moieties capable of recognising the cell-surface located polypeptides described 
herein. 

In addition, the surface-localisation of the polypeptides makes them suitable as tar- 
10 gets for inhibitors. Such inhibitors may be bactericidal or bacteristatic or prevent inter- 
action of Streptococcus pneumoniae with the host organism (virulence). Thus, in a 
further main aspect, the invention relates to methods for identifying inhibitors of the 
cell-surface located polypeptides described herein. 



15 Definitions 

- Vaccine - is used to indicate a composition capable of inducing a protective im- 
mune response against a microorganism in a human being or animal. 

_ Protective immune response - is used to indicate an immune response (hu- 
moral/antibody and/or cellular) inducing memory in an organism, resulting in the 
20 infectious agent, herein Streptococcus pneumoniae, being met by a secondary 

rather than a primary response, thus reducing its impact on the host organism. 

- Polypeptide - unless specified otherwise, the term 'polypeptide' when used herein 
can also refer to a variant or fragment of a polypeptide. Preferred polypeptides are 
antigenic polypeptides. 

25 - Fragment - is used to indicate a non-full length part of a polypeptide. Thus, a 

fragment is itself also a polypeptide. 

- Variant - a Variant' of a given reference polypeptide refers to a polypeptide that 
displays a certain degree of sequence identity to said reference polypeptide, but is 
not identical to said reference polypeptide. 

30 - Antigen / antigenic / antigenicity / immunogen / immunogenic / immunogenicity - 

all refer to the capability of inducing an immune response. 

- Immunogenic carrier - refers to a compound which directly or indirectly assists or 
strengthens an immune response. 

- Expression vector - refers to a, preferably recombinant, plasmid or phage or virus 
35 to be used in production of a polypeptide from a polynucleotide sequence. An ex- 
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pression vector comprises an expression construct, comprising an assembly of (1) 
a genetic element or elements having a regulatory role in gene expression, for 
example, promoters or enhancers, (2) a structural or coding sequence which is 
transcribed into mRNA and translated into protein, and which is operably linked to 
5 the elements of (1); and (3) appropriate transcription initiation and termination se- 

quences. 

Binding partner - of a polypeptide refers to a molecule that can bind to said poly- 
peptide. Such binding can be indirect, through another molecule, but is preferably 
direct. A binding partner can be any type of molecule, such as e.g. small hydro- 
10 phobic molecules or e.g. a cellular or extracellular macromolecule, such as a pro- 

tein, a carbohydrate or a nucleic acid. Preferred types of binding partners include 
antibodies, ligands or inhibitors. 

Plurality - the term 'plurality' indicates more than one, preferably more than 10. 
Indicator moiety - the term 'indicator moiety' covers a molecule or a complex of 

15 molecules that is capable of specifically binding a given polypeptide and/or cell, 

and is capable of generating a detectable signal. Preferably, the indicator moiety 
is an antibody or comprises an antibody molecule. Thus, a preferred indicator 
moiety is an antibody coupled to or in complex with a detectable substance. 
Host-derived molecule or host molecule - refers to a molecule which is normally 

20 found in a host organism that can be infected with Streptococcus pneumoniae. A 

host-derived molecule is preferably a host polypeptide, preferably a human poly- 
peptide. 

- Antibody - the term 'antibodies' when used herein is intended to cover antibodies 
as well as functional equivalents thereof. Thus, this includes polyclonal antibodies, 

25 monoclonal antibodies (mAbs), humanised, human or chimeric antibodies, single- 

chain antibodies, and also binding fragments of antibodies, such as, but not lim- 
ited to, Fab fragments, F(ab') 2 fragments, fragments produced by a Fab expres- 
sion library, anti-idiotypic antibodies, hybrids comprising antibody fragments, and 
epitope-binding fragments of any of the these. The term also includes multivalent, 

30 multispecific, such as bispecific antibodies and mixtures of monoclonal antibodies. 

Dissociation constant, Kd, is a measure to describe the strength of binding (or 
affinity or avidity) between macromolecules, for example an antibody and its anti- 
gen. The smaller Kd the stronger binding. 

Isolated - used in connection with polypeptides, polynucleotides and antibodies 
35 disclosed herein 'isolated' refers to these having been identified and separated 
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and/or recovered from a component of their natural, typically cellular, environ- 
ment. Contaminant components of the natural environment are materials that 
would typically interfere with diagnostic or therapeutic uses for the polypeptide, 
and may include enzymes, hormones, and other proteinaceous or non- 
5 proteinaceous solutes. Polypeptides, polynucleotides and antibodies of the inven- 

tion are preferably isolated, and vaccines and other compositions of the invention 
preferably comprise isolated polypeptides or isolated polynucleotides or isolated 
antibodies. 



10 Detailed description 
Figures 

Figure 1: A table of preferred compositions of the invention. The numbers in the col- 
umns and rows indicate SEQ ID NOs. Each cross refers to a composition comprising 
the polypeptide (or antigenic fragment or variant thereof) of the column to which the 
15 cross belongs as well as the polypeptide (or antigenic fragment or variant thereof) of 
the row to which the cross belongs. 

Figure 2: List of amino acid sequences of surface-located Streptococcus pneumoniae 
polypeptides. 

20 

Figure 3: RT-PCR with cDNA derived from a spleen from a mouse infected with 
S. pneumoniae D39 at 1 day of infection. Primers were used specific for transcripts 
for antigens 029 (SEQ ID NO:16), 060 (SEQ ID NO:26), 607 (SEQ ID NO:20) and 
653 (SEQ ID NO:33). Moreover, primers were used specific for transcript of the 
25 Sigma 70 subunit of the pneumococcal RNA-Polymerase (house keeping gene). -RT: 
control without reverse transcriptase; +RT: RT-PCR; N: non-template control. 



Figure 4: Immunoblot with patient serum (single patient) for detection of rec. vac. (an- 
tigens) 029, 060, 144, 487, 607, 646 and 653. 

Figure 5: Immunogenicity of antigens (ags) 029, 060, 607, 653 and controls with un- 
treated animals (non immunized), Alum adjuvants alone, and an unrelated antigen at 
days 0, 21 and 35 of vaccination. 
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Figure 6: CFU 6 h after challenge with S. pneumoniae D39 in blood of mice vacci- 
nated with antigens (ags) 029, 060, 607, 653 and controls with untreated animals 
(non immunized), Alum adjuvants alone, and unrelated antigen. 

Figure 7: Survivors after challenge with S. pneumoniae D39 of mice vaccinated with 
antigens (ags) 029 and 607 compared with control groups with untreated animals 
(non immunized), Alum adjuvants alone and an unrelated antigen (sigma) 

Compositions for use as a medicament 

In a first main aspect, the invention relates to a composition comprising 

- a polypeptide which comprises a sequence selected from the group consisting 
of surface-located Streptococcus pneumoniae polypeptides of SEQ ID NO:1- 
282, or comprises an antigenic fragment or variant of said sequence, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, or 

- an antibody capable of binding said polypeptide, 
for use as a medicament. 

In an important embodiment, the composition comprises 

- a polypeptide which comprises a sequence selected from the group consisting 
of SEQ ID NO:1-282, or comprises an antigenic fragment or variant of said 
sequence, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

Said composition can be used as a vaccine for active immunisation of an individual in 
need thereof. This is described in the section Vaccine compositions and methods of 
vaccination of the invention'. 

In one preferred embodiment, the composition comprises a polypeptide which 
comprises a sequence selected from the group consisting of SEQ ID NO:1-282 or 
comprises antigenic fragment or variant of said sequence. 
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In another important embodiment, the composition comprises an antibody capable of 
binding a polypeptide selected from the group consisting of surface-located 
Streptococcus pneumoniae polypeptides of SEQ ID NO:1-282. Said composition can 
5 e.g. be used in passive immunisation of an individual in need thereof. This is 
described in the section 'antibodies and methods for raising antibodies of the 
invention'. 

Vaccine compositions and methods of vaccination of the invention 

The goal of vaccination or active immunisation is to provide protective immunity by 
inducing a memory response to an infectious microorganism using an antigenic or 
immunogenic composition. Thus, a vaccine is a composition capable of inducing a 
protective immune response against a microorganism in a human being or animal. 
Such an immune response can be a cellular response and/or a humoral response, 
e.g. a specific T cell response or an antibody response. 

Accordingly, in an important embodiment, the composition is a vaccine composition. 
I.e. the invention relates to the use of a composition comprising 

a polypeptide which comprises a sequence selected from the group consisting 
20 of surface-located Streptococcus pneumoniae polypeptides of SEQ ID NO:1- 

282, or comprises an antigenic fragment or variant of said sequence, 
a polynucleotide comprising a sequence encoding said polypeptide, 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
25 expression vector, 

as a vaccine. 

The variant herein preferably has at least 95% sequence identity, such as at least 
96%, e.g. at least 97%, such as at least 98%, e.g. at least 99% sequence identity to 
said sequence. 

30 

In one preferred embodiment of the above composition, the polypeptide comprises 
SEQ ID NO:1 , or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:2, or an antigenic fragment or variant thereof. 



10 
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In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:3, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:4, or an antigenic fragment or variant thereof. 
5 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:5, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:6, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

10 comprises SEQ ID NO:7, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:8, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:9, or an antigenic fragment or variant thereof. 

15 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO: 10, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:1 1 , or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

20 comprises SEQ ID NO: 12, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO: 13, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NG:14, or an antigenic fragment or variant thereof. 

25 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO: 15, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:16, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

30 comprises SEQ ID NO: 17, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:18, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO: 19, or an antigenic fragment or variant thereof. 



WO 2006/084467 



14 



PCT/DK2006/000073 



In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:20, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:21 , or an antigenic fragment or variant thereof. 
5 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:22, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:23, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

10 comprises SEQ ID NO:24, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:25, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:26, or an antigenic fragment or variant thereof. 

15 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:27, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:28, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

20 comprises SEQ ID NO:29, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:30, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:31, or an antigenic fragment or variant thereof. 

25 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:32, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:33, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

30 comprises SEQ ID NO:34, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:35, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:36, or an antigenic fragment or variant thereof. 
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In another preferred embodiment of the above composition, 
comprises SEQ ID NO:37, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:38, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:39, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:40, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:41, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:42, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:43, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:44, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:45, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:46, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:47, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:48, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:49, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:50, or an antigenic fragment or variant thereof 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:51, or an antigenic fragment or variant thereof, 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:52, or an antigenic fragment or variant thereof, 
In another preferred embodiment of the above composition, 
comprises SEQ ID NO:53, or an antigenic fragment or variant thereof. 
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In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:54, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:55, or an antigenic fragment or variant thereof. 
5 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:56, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:57, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

10 comprises SEQ ID NO:58, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:59, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:60, or an antigenic fragment or variant thereof. 

15 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:61 , or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:62, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

20 comprises SEQ ID NO:63, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:64, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:65, or an antigenic fragment or variant thereof. 

25 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:66, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:67, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

30 comprises SEQ ID NO:68, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:69, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:70, or an antigenic fragment or variant thereof. 
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In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:71, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:72, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:73, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:74, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:75, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:76, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:77, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:78, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:79, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:80, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:81, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:82, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:83, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:84, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:85, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:86, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 

comprises SEQ ID NO:87, or an antigenic fragment or variant thereof. 
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In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:88, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:89, or an antigenic fragment or variant thereof. 
5 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:90, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:91 , or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

10 comprises SEQ ID NO:92, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:93, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:94, or an antigenic fragment or variant thereof. 

15 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:95, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:96, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

20 comprises SEQ ID NO:97, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:98, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:99, or an antigenic fragment or variant thereof. 

25 In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO: 100, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:101, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 

30 comprises SEQ ID NO: 102, or an antigenic fragment or variant thereof. 

In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:103, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:104, or an antigenic fragment or variant thereof. 
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In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO:105, or an antigenic fragment or variant thereof. 
In another preferred embodiment of the above composition, the polypeptide 
comprises SEQ ID NO: 168, or an antigenic fragment or variant thereof. 

A composition comprising the polypeptide of SEQ ID NO:16, or an antigenic fragment 
or variant thereof for use as a medicament is at present the most preferred 
embodiment. 

In some embodiments of the composition, the polypeptide consists of a sequence 
selected from the group of SEQ ID NO: 1-282. In other embodiments, the polypeptide 
comprises a sequence selected from the group of SEQ ID NO: 1-282 or an antigenic 
fragment or variant of said sequence, as well as a tag, such as a his-tag, i.e. a 
polyhistidine tag. 

In another preferred embodiment, the polypeptide in the composition of the invention 
may be combined with or fused to a toxin, e.g. an enterotoxigenic Escherichia coli 
Stable or Labile toxin. A suitable heat stable toxin II (STII) has been described in Lee 
et al. (1983) Infect. Immun. 42: 264-268. Examples of suitable fusion proteins are 
given in SEQ ID NO:295 and SEQ ID NO:296. In one embodiment, the combination 
comprises the polypeptide of the invention and a non-covalently linked toxin, wherein 
the toxin may be a single toxin polypeptide, or a multimeric, e.g. dimeric, form com- 
prising multiple copies of the toxin. In another embodiment, the polypeptide of the 
invention and the toxin are covalently linked, e.g. by post-translational linkage or tran- 
scription and translation from a single fused open reading frame. In either case, the 
two constituents may be linked directly or via a spacer or linker domain, which e.g. 
may be a peptide linker, preferably a protease-resistant and/or non-immunogenic 
peptide linker. Such peptide linker may be of any length, e.g. it may be between 2 and 
200, such as between 5 and 50 amino acids in length. Multiple copies of the toxin 
may be fused to the polypeptide of the invention. 

A composition comprising a polypeptide of the invention, e.g. the polypeptide of SEQ 
ID NO: 16, as well as an enterotoxigenic Escherichia coli may be used to manufacture 
a vaccine for prevention of infection with Streptococcus pneumoniae and/or entero- 
toxigenic Escherichia. 
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In further embodiments, the composition of the invention may comprise dimers of any 
of the polypeptides of SEQ ID NO:1-282, such as dimers of SEQ ID NO:16. Dimers 
may e.g. be formed by post-translational linkage or be generated from a single fused 
open reading frame. In either case, the two monomer units of the dimer may be linked 
directly or via a spacer or linker domain, which e.g. may be a peptide linker, prefera- 
bly a protease-resistant and/or non-immunogenic peptide linker. Such a peptide linker 
may be of any length, e.g. it may be between 2 and 200, such as between 5 and 50 
amino acids in length. 

The composition may only comprise one polypeptide selected from the group of SEQ 
ID NO: 1-282 or a antigenic fragment or variant thereof. However, in other 
embodiments, the composition comprises more than one polypeptide of the group of 
SEQ ID NO: 1-282 and/or more than one antigenic fragment of a polypeptide selected 
from the group of SEQ ID NO:1-282. Thus, the composition according to the invention 
may comprise more than one, such as 2, for example 3, such as 4, for example 5, 
such as 6, for example 7, such as 8, for example 9, such as 10, such as a number of 
polypeptides and/or fragments in the range of from 5 to 10, or more than 10, such as 
for example in the range of from 10 to 20, different polypeptides selected from the 
group of SEQ ID NO: 1-282 or antigenic fragments or variants thereof. 

Similarly, the composition may only comprise one polynucleotide, one ex- 
pression vector or one recombinant virus or recombinant cell of the invention. How- 
ever, in other embodiments, the composition comprises more than one polynucleo- 
tide, one expression vector or one recombinant virus or recombinant cell of the inven- 
tion. Thus, the composition according to the invention may comprise more than one, 
such as 2, for example 3, such as 4, for example 5, such as 6, for example 7, such as 
8, for example 9, such as 10, or more than 10, such as for example in the range of 
from 10 to 20, different polynucleotides, expression vectors or recombinant viruses or 
recombinant cells of the invention as described herein. 

Furthermore, in some embodiments, a recombinant cell of the invention may 
express more than one polypeptide of the group of SEQ ID NO:1-282 and/or more 
than one antigenic fragment or variant of a polypeptide selected from the group of 
SEQ ID NO:1-282. Thus, the composition according to the invention may comprise a 
recombinant cell comprising more than one, such as 2, for example 3, such as 4, for 
example 5, such as 6, for example 7, such as 8, for example 9, such as 10, such as a 
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number of polypeptides and/or antigenic fragments or variants in the range of from 5 
to 10, or more than 10, such as for example in the range of from 10 to 20, different 
polypeptides selected from the group of SEQ ID NO:1-282 or antigenic fragments or 
variants thereof. In another embodiment, the composition for use in the invention 
5 comprises multiple of the recombinant viruses or recombinant cells described herein. 

Preferably, the composition of the invention comprises one of the combinations of 
polypeptides (or antigenic fragments or variants thereof) given in Table 1. 

10 In Table 1, each of the crosses ("x") placed at the crossing of a column designated 
with a SEQ ID number with a row designated by another SEQ ID number indicates a 
composition comprising the two polypeptides of those two SEQ ID numbers (or anti- 
genic fragments or variants thereof). 

15 I.e. as an example, entirely for illustrative purposes and not intended in a limiting 
manner, the cross ("x") at the crossing of the column of SEQ ID NO:2 ("2") with the 
row of SEQ ID NO:1 ("1") indicates a composition comprising: 

- the polypeptide of SEQ ID NO:1 or an antigenic fragment or variant thereof 
and 

20 - the polypeptide of SEQ ID NO:2 or an antigenic fragment or variant thereof. 

Highly preferred compositions include: 

A composition comprising: 
25 - the polypeptide of SEQ ID NO: 16 or an antigenic fragment or variant thereof 
and 

- any of the polypeptides of SEQ ID NO: 1-282 or an antigenic fragment or variant 
thereof, preferably any of SEQ ID NO: 1-41 or an antigenic fragment or variant 
thereof, more preferably a polypeptide selected from the group consisting of SEQ ID 

30 NO:10, SEQ ID NO:13, SEQ ID NO:20 and SEQ DI NO:28, most preferably the poly- 
peptide of SEQ ID NO:20 or an antigenic fragment or variant thereof. 

A composition comprising: 

- the polypeptide of SEQ ID NO: 10 or an antigenic fragment or variant thereof 
35 and 
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- any of the polypeptides of SEQ ID NO:1-282 or an antigenic fragment or variant 
thereof, preferably any of SEQ ID NO:1-41 or an antigenic fragment or variant 
thereof, more preferably a polypeptide selected from the group consisting of SEQ ID 
NO:13, SEQ ID NO:20 and SEQ Dl NO:28, most preferably the polypeptide of SEQ 

5 ID NO:20 or an antigenic fragment or variant thereof. 

A composition comprising: 

- the polypeptide of SEQ ID NO: 13 or an antigenic fragment or variant thereof 
and 

10 - any of the polypeptides of SEQ ID NO: 1-282 or an antigenic fragment or variant 
thereof, preferably any of SEQ ID NO: 1-41 or an antigenic fragment or variant 
thereof, more preferably a polypeptide selected from the group consisting of SEQ ID 
NO:20 and SEQ Dl NO:28, most preferably the polypeptide of SEQ ID NO:20 or an 
antigenic fragment or variant thereof. 

15 

A composition comprising: 

- the polypeptide of SEQ ID NO:28 or an antigenic fragment or variant thereof 
and 

- any of the polypeptides of SEQ ID NO:1-282 or an antigenic fragment or variant 
20 thereof, preferably any of SEQ ID NO: 1-41 or an antigenic fragment or variant 

thereof, the polypeptide of SEQ ID NO:20 or an antigenic fragment or variant thereof. 

Preferred compositions comprising at least three polypeptides include the following: 

25 A composition comprising three or more polypeptides selected from the group con- 
sisting of SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:16, SEQ ID NO:20 and SEQ ID 
NO.28. 

A composition comprising: 
30 - the polypeptide of SEQ ID NO:1 6 or an antigenic fragment or variant thereof 
and 

- the polypeptide of SEQ ID NO:20 or an antigenic fragment or variant thereof 
and 

- any of the polypeptides of SEQ ID NO:1-282 or an antigenic fragment or variant 
35 thereof, preferably any of SEQ ID NO: 1-41 or an antigenic fragment or variant 
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thereof, more preferably a polypeptide selected from the group consisting of SEQ ID 
NO:10, SEQ ID NO:13 and SEQ Dl NO:28. 

Further preferred compositions according to the invention, comprising four or more 
5 polypeptides selected from the group consisting of SEQ ID NO:10, SEQ ID NO:13, 
SEQ ID NO:16, SEQ ID NO:20 and SEQ ID NO.28. 

In a yet further preferred embodiment, the composition of the invention comprises the 
five polypeptides of SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:16, SEQ ID NO:20 
10 and SEQ ID NO.28. 

In some embodiments of the above compositions comprising two or more polypep- 
tides, the polypeptides are not covalently linked. In other embodiments, however, the 
polypeptides may form a fusion polypeptide, which is formed by post-translational 
15 linkage or generated from a single fused open reading frame. In either case, the two 
or more polypeptides may be linked directly or via a spacer or linker domain, which 
e.g. may be a peptide linker, preferably a protease-resistant and/or non-immunogenic 
peptide linker. Such a peptide linker may be of any length, e.g. it may be between 2 
and 200, such as between 5 and 50 amino acids in length. 

20 

Vaccines comprising polypeptides 

As described above, in a preferred embodiment, the invention relates to a composi- 
tion comprising a polypeptide which comprises a sequence selected from the group 
consisting of SEQ ID NO: 1-282, or an antigenic fragment or variant of said sequence, 
25 for use as a vaccine. Preferred fragments and variants are those described in the 
sections herein that relate to fragments and variants. 

Accordingly, in this embodiment, the antigenicity or immunogenicity is provided by 
direct administration of a polypeptide normally located on the surface of a Streptococ- 

30 cus pneumoniae cell. In one particular embodiment, the polypeptides are selected so 
that the vaccine composition comprises multiple polypeptides capable of associating 
with different MHC molecules, such as different MHC class I molecules. Preferably, 
the composition for use as a vaccine comprises polypeptides and/or fragments capa- 
ble of associating with the most frequently occurring MHC class I molecules. In one 

35 particular embodiment of the invention, the composition comprises one or more poly- 
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peptides and/or fragments capable of associating to an MHC class I molecule and 
one or more polypeptides and/or fragments capable of associating with an MHC class 
II molecule. Hence, the vaccine composition is in some embodiments capable of rais- 
ing a specific cytotoxic T-cells response and/or a specific helper T-cell response. As- 
5 sociation to MHC molecules can e.g. be determined as described by Andersen et al. 
(1999) Tissue Antigens 54:185; or by Tan et al. (1997) J. Immunol. Methods 209:25. 

Adjuvants and immunogenic carriers 

Preferably, the composition for use as vaccine, i.e. the vaccine composition, of the 

10 present invention comprises a pharmaceutically-acceptable carrier as described 
herein in the section 'Compositions for use in the invention'. 

The composition can further comprise an adjuvant. Adjuvants are sub- 
stances whose admixture into the vaccine composition increases or otherwise modi- 
fies the immune response to a polypeptide or other antigen. Adjuvants could for ex- 

15 ample be any of: AIK(S0 4 ) 2 , AINa(S0 4 ) 2 , AINH 4 (S0 4 ), silica, alum, AI(OH) 3 , Ca 3 
(P0 4 ) 2 , kaolin, carbon, aluminium hydroxide, aluminium phosphate, muramyl dipep- 
tides, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-DMP), N-acetyl-nornuramyl-L- 
alanyl-D-iso-glutamine (CGP 11687, also referred to as nor-MDP), N-acetylmuramyul- 
L-alanyl-D-isoglutaminyl-L-alanine-2-(1'2'-dipalmitoyl-sn-glycero-3-hydroxphosphoryl 

20 oxy)~ethylamine (CGP 19835A, also referred to as MTP-PE), RIBI (MPL+TDM+CWS) 
in a 2% squalene/Tween-80.RTM. emulsion, lipopolysaccharides and derivatives, 
including lipid A, Freund's Complete Adjuvant (FCA), Freund's Incomplete Adjuvants, 
Merck Adjuvant 65, polynucleotides (for example, poly IC and poly AU acids), wax D 
from Mycobacterium, tuberculosis, substances found in Corynebacterium parvum, 

25 Bordetella pertussis, and members of the genus Brucella, liposomes or other lipid 
emulsions, Titermax, ISCOMS, Quil A, ALUN (see US 58767 and 5,554,372), Lipid A 
derivatives, choleratoxin derivatives, HSP derivatives, LPS derivatives, synthetic pep- 
tide matrixes or GMDP, Interleukin 1, Interleukin 2, Montanide ISA-51 and QS-21. 
Preferred adjuvants to be used with the invention include alum, Montanide ISA-51 

30 and QS-21. Montanide ISA-51 (Seppic, Inc.) is a mineral oil-based adjuvant analo- 
gous to incomplete Freund's adjuvant, which is normally administered as an emulsion. 
QS-21 (Antigenics; Aquila Biopharmaceuticals, Framingham, MA) is a highly purified, 
water-soluble saponin that handles as an aqueous solution. Another preferred adju- 
vant to be used in the composition of the invention is IMSAVAC-L from the Nether- 
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lands Vaccine Institute. In another preferred embodiment, the polypeptide or polypep- 
tides are included in virosomes. 

Desirable functionalities of adjuvants capable of being used in accordance 
with the present invention are listed in the below table. 



Table 1 Modes of adjuvant action 



Action 



Adjuvant type 



Benefit 



1> Immunomodulation 

2. Presentation 

3. CTL Induction 



4. Targeting 



5. Depot generation 



Generally small molecules or proteins which modify the 
cytokine network 

Generally amphipalhlc molecules or complexes which 
interact with immunogen in its native conformation 

# Particles which can bind or enclose immunogen and 
which can fuse with or disrupt cell membranes 

# w/o emulsions for direct attachment of peptide to cell 
surface MHC-1 

♦ Particulate adjuvants which bind immunogen. Adjuvants 
which saturate Kupffer cells 

, Carbohydrate adjuvants which target lectin receptors on 
macrophages and DCs 

• w/o emulsion for short term 

* Microspheres or nanospheres for long term 



Upregulation of immune response. Selection of Th1 or 
Th2 

increased neutralizing antibody response. Greater 
duration of response 

Cytosolic processing of protein yielding correct class 
1 restricted pepUdes 

Simple process if promiscuous peptide(s) known 

Efficient use of adjuvant and immunogen 

As above. May also determine type of response if 

targeting selective 

Efficiency 

Potential for single-dose vaccine 



Source: John C. Cox and Alan R. Coulter Vaccine 1997 Feb;15(3):248-56 



A vaccine composition according to the present invention may comprise more than 
10 one different adjuvant. It is also contemplated that the Streptococcus pneumoniae 
polypeptide of the invention, or one or more antigenic fragments thereof, and the ad- 
juvant can be administered separately in any appropriate sequence. 

The adjuvant of choice may e.g. Freund's complete or incomplete adjuvant, 
or killed B. pertussis organisms, used e.g. in combination with alum precipitated anti- 
15 gen. A general discussion of adjuvants is provided in Goding, Monoclonal Antibodies: 
Principles & Practice (2nd edition, 1986) at pages 61-63. Goding notes, however, that 
when the antigen of interest is of low molecular weight, or is poorly immunogenic, 
coupling to an immunogenic carrier is recommended (see below). Various saponin 
extracts and cytokines have also been suggested to be useful as adjuvants in immu- 
20 nogenic compositions. Recently, it has been proposed to use granulocyte- 
macrophage colony stimulating factor (GM-CSF), a well known cytokine, as an adju- 
vant (WO 97/28816). 



In addition, a vaccine composition of the invention can comprise an immunogenic 
25 carrier such as a scaffold structure, for example a protein or a polysaccharide, to 
which the Streptococcus pneumoniae polypeptide or the fragment thereof is capable 
of being associated. A Streptococcus pneumoniae polypeptide, or the antigenic frag- 
ment or variant thereof, present in the vaccine composition can thus be associated 
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with an immunogenic carrier such as e.g. a protein. The binding or association of the 
polypeptide to a carrier protein may be covalent or non-covalent. An immunogenic 
carrier protein may be present independently of an adjuvant. The function of a carrier 
protein can for example be to increase the molecular weight of in particular fragments 
5 in order to increase their activity or immunogenicity, to confer stability, to increase the 
biological activity, or to increase serum half-life. Furthermore, an immunogenic carrier 
protein may aid presenting the Streptococcus pneumoniae polypeptide or the frag- 
ments thereof to T cells. A carrier protein could be, but is not limited to, keyhole limpet 
hemocyanin, serum proteins such as transferrin, bovine serum albumin, human se- 

10 rum albumin, thyroglobulin or ovalbumin, immunoglobulins, or hormones, such as 
insulin. Tetanus toxoid and/or diptheria toxoid are also suitable carriers in one em- 
bodiment of the invention. Alternatively or additionally, dextrans, for example sepha- 
rose may be added. In yet another embodiment, an antigen-presenting cell such as 
e.g. a dendritic cell capable of presenting the polypeptide or a fragment thereof to a T 

15 cell may be added to obtain the same effect as a carrier protein. Methods for the 
preparation of vaccine compositions have e.g. been described in US 5,470,958 and 
references therein. 

In a further embodiment, the vaccine composition of the invention may comprise 
20 Streptococcus pneumoniae carbohydrates in addition to a polypeptide of the inven- 
tion. In one embodiment, the added carbohydrates are carbohydrates derived from or 
characteristic of one or more serotypes of Streptococcus pneumoniae. In a preferred 
embodiment, the polypeptide of the invention is combined with polysaccharides de- 
rived from or characteristic of any one or more of the serotypes given in Table 4. In a 
25 preferred embodiment, the polypeptide is combined with one or more, preferably two, 
three, four, five, six or seven polysaccharides derived from or characteristic of sero- 
type 4, 6B, 9V, 14, 18C, 19F and 23F. In another embodiment, the polypeptide is 
combined with eight or more, preferably ten or more, 15 or more, or 20 or more of the 
polysaccharide antigens of serotypes 1, 2, 3, 4, 5, 6B, 7F, 8, 9N, 10A, 11A, 12F, 14, 
30 15B, 17F, 18C, 19A, 19F, 20, 22F, 23F, and 33F. These carbohydrates may added in 
free form to the vaccine composition of the invention, or, alternatively, they may be 
fused to a polypeptide of the invention to be used in the vaccine composition. 



An effective amount of a polypeptide of the invention may be an amount capable of 
35 eliciting a detectable humoral immune response in the absence of an immunomodula- 
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tor. The appropriate amount of immunogen to be used is dependent on the immu- 
nological response it is desired to elicit. Furthermore, the exact effective amount nec- 
essary may vary from subject to subject, depending on the species, age and general 
condition of the subject, the severity of the condition being treated, the mode of ad- 
5 ministration, etc. The polypeptide vaccines of the present invention may be adminis- 
tered in various dosages, including dosages that are lower than those normally used 
for other vaccines. This possible because the polypeptides of the present invention 
are abundant on the surface of a Streptococcus pneumoniae cell and thus even a 
fairly low level of response can provide immunity. Thus, dosage of a polypeptide of 
10 the invention, when used for immunisation, may e.g. be from 0.1 to 500 micrograms 
per kilogram body weight, such as from 0.1 to 100 micrograms, e.g. from 0.1 to 50 
micrograms, such as from 0.1 to 25 micrograms, such as in the range of from 8 to 25 
micrograms per kilogram body weight, or less than that, such as from 0.1 to 5 micro- 
grams or from 0.1 to 2 micrograms per kilograms body weight. 

15 

DNA vaccine compositions and vaccine compositions comprising recombinant viruses 
or recombinant cells 

DNA or RNA vaccines pertain to the introduction of e.g. an antigenic polypeptide de- 
terminant into a patient by overexpressing in the cells of the patient, a polynucleotide 

20 construct which includes expression control sequences operably linked to a sequence 
encoding the polypeptide of interest, herein a polypeptide of any of SEQ ID NO: 1-282 
or an antigenic fragment or variant thereof, preferably the polypeptide of SEQ ID 
NO: 16 or an antigenic fragment or variant thereof. As such fragments may not contain 
a methionine start codon, such a codon is optionally included as part of the expres- 

25 sion control sequences. The polynucleotide construct may be a non-replicating and 
linear polynucleotide, a circular expression vector, or an autonomously replicating 
plasmid or viral expression vector. The construct may become integrated into the host 
genome. Any expression vector that can transfect a mammalian cell may be used in 
the methods of immunising an individual according to the present invention. Methods 

30 for constructing expression vectors are well known in the art (see, e.g., Molecular 
Cloning: A Laboratory Manual, Sambrook et al., eds., Cold Spring Harbor Laboratory, 
2nd Edition, Cold Spring Harbor, N.Y., 1989). Preferred are compositions comprising 
a plurality of genes expressing multiple polypeptides selected from SEQ ID NO:1-282 



WO 2006/084467 



PCT/DK2006/000073 



28 

and/or multiple antigenic fragments of the invention, thereby permitting simultaneous 
vaccination using a variety of preselected targets. 

Vaccines can also be prepared by incorporating a polynucleotide encoding a 
specific antigenic polypeptide of interest into a living but harmless vector, such as a 
5 virus or a cell, such as an attenuated or reduced-virulence E. coli or Salmonella cell. 
The harmless recombinant virus or recombinant cell is injected into the intended re- 
cipient. Such a recombinant cell may be dead or alive. If alive, the recombinant or- 
ganism may replicate in the host while producing and presenting the antigenic poly- 
peptide to the host's immune system. It is contemplated that this type of vaccine may 
10 be more effective than the non-replicative type of vaccine. For such a vaccine to be 
successful, the vector organism must be viable, and either be naturally non-virulent or 
have an attenuated or reduced-virulence phenotype. 

Strategies for vaccination using attenuated bacteria and suitable bacterial 
strains for use therein have been described in e.g. Makino et al. (2001) Microb. 
15 Pathog. 31:1-8; Gentschev et al. (2002) Int. J. Med. Microbiol. 291:577-582; Turner et 
al. (2001) Infect. Immun. 69:4969-4979; WO99/49026; and WO03/022307. 

Further examples of vectors that can be applied are vectors comprising e.g., 
retroviruses, as disclosed in WO 90/07936, WO 91/02805, WO 93/25234, WO 
93/25698, and WO 94/03622, adenovirus, as disclosed by Berkner, Biotechniques 

20 6:616-627, 1988; Li et al., Hum. Gene Ther. 4:403-409, 1993; Vincent et al., Nat. Ge- 
net. 5:130-134, 1993; and Kolls et al., Proc. Natl. Acad. Sci. USA 91:215-219, 1994), 
pox virus, as disclosed by U.S. 4,769,330; U.S. Pat. No. 5,017,487; and WO 
89/01973, naked DNA as disclosed WO 90/11092, a polynucleotide molecule com- 
plexed to a polycationic molecule as disclosed in WO 93/03709, and polynucleotides 

25 associated with liposomes as disclosed by Wang et al., Proc. Natl. Acad. Sci. USA 
84:7851, 1987. In certain embodiments, the DNA may be linked to killed or inacti- 
vated adenovirus as disclosed by Curiel et al., Hum. Gene Ther. 3:147-154, 1992; 
Cotton et al., Proc. Natl. Acad. Sci. USA 89:6094, 1992. Other suitable compositions 
include DNA-ligands as disclosed by Wu et al., J. Biol. Chem. 264:16985-16987, 

30 1989), and lipid-DNA combinations as disclosed by Feigner et al., Proc. Natl. Acad. 
Sci. USA 84:7413-7417, 1989). In addition, the efficiency of naked DNA uptake into 
cells may be increased by coating the DNA onto biodegradable latex beads. 

Vaccine vectors preferably comprise a suitable promoter which is operably 
linked to the polynucleotide sequence encoding the immunogenic polypeptide. Any 
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promoter that can direct a high level of transcription initiation in the target cells may 
be used in the invention. Non-tissue specific promoters, such as the cytomegalovirus 
(DeBernardi et al., Proc Natl Acad Sci USA 88:9257-9261 [1991], and references 
therein), mouse metallothionine I (Hammer et al., J Mol Appl Gen 1:273-288 [1982]), 
5 HSV thymidine kinase (McKnight, Cell 31:355-365 [1982]), and SV40 early (Benoist 
et al., Nature 290:304-310 [1981]) promoters may thus also be used. 



Methods of vaccination and use for vaccination/immunisation 

In a further main aspect, the present invention relates to the use of a composition 
10 comprising any one or more of 

a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO:1-282, or comprises an antigenic fragment or variant of said 
sequence, 

a polynucleotide comprising a sequence encoding said polypeptide, 
15 an expression vector comprising a sequence encoding said polypeptide, or 

a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

for the preparation of a medicament for the immunisation of an animal or human 
being against bacterial infections. The immunisation preferably induces a protective 
20 immune response. In one embodiment of the above use, the medicament is only 
given once. 

In a preferred embodiment, the medicament is for the immunisation against 
Streptococcus infections. Most preferably, the medicament is for immunisation 

25 against Streptococcus pneumoniae. Immunisation with a Streptococcus pneumoniae 
polypeptide can, however, also give cross-protection to other bacterial species. This 
normally requires significant homology to at least a portion of a polypeptide of the 
other species. Such homology is e.g. found between SEQ ID NO: 16 and variants 
thereof from Streptococcus pyogenes (group A Streptococcus) (SEQ ID NO:283), 

30 Streptococcus agalactiae (group B Streptococcus) (SEQ ID NO:284) and Listeria 
monocytogenes (SEQ ID NO:285). Similarly, homology is found between SEQ ID 
NO:20 and variants thereof from Streptococcus pyogenes (group A Streptococcus) 
(SEQ ID NO:286), Streptococcus agalactiae (group B Streptococcus) (SEQ ID 
NO:287) and Listeria monocytogenes (SEQ ID NO:288). 



35 
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Accordingly, the medicament is in some embodiments used for the immunisation 
against one or more of: Streptococcus pyogenes (group A Streptococcus), 
Streptococcus agalactiae (group B Streptococcus) and Listeria monocytogenes. 
Highly preferred polypeptides for use in the preparation of such a medicament are 
SEQ ID NO:16 and SEQ ID NO:20. 

An alternative strategy for immunisation against one or more bacteria is to immunise 
with a medicament comprising the variant polypeptide. Accordingly, in a further 
embodiment, the polypeptide used for the preparation of the medicament is a variant 
of any of SEQ ID NO: 1-282, preferably a variant of SEQ ID NO: 16 and/or a variant of 
SEQ ID NO:20. Most preferably, the polypeptide is selected from the group consisting 
of SEQ ID NO:283, SEQ ID NO:284, SEQ ID NO:285, SEQ ID NO:286, SEQ ID 
NO:287 and SEQ ID NO:288, or a fragment thereof or a variant thereof, e.g. a variant 
having more than 95%, such as more than 98% sequence identity to SEQ ID NO:283, 
SEQ ID NO:284, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:287 or SEQ ID 
NO:288. 

Accordingly, in some embodiments: 

- a medicament comprising SEQ ID NO:283 and/or SEQ ID NO:286, or a fragment or 
variant of any of these two, is used to immunise against Streptococcus pyogenes 
and/or Streptococcus pneumoniae and/or other bacteria; 

- a medicament comprising SEQ ID NO:284 and/or SEQ ID NO:287, or a fragment or 
a variant of any of these two, is used to immunise against Streptococcus agalactiae 
and/or Streptococcus pneumoniae and/or other bacteria 

or 

- a medicament comprising SEQ ID NO:285 and/or SEQ ID NO:288, or a fragment or 
a variant of any of these two, is used immunise against Listeria monocytogenes 
and/or Streptococcus pneumoniae and/or other bacteria 

In the most preferred embodiment, the composition herein comprises or further 
comprises 

- a polypeptide which comprises SEQ ID NO: 16, or comprises an antigenic 
fragment or variant of SEQ ID NO: 16, 

a polynucleotide comprising a sequence encoding said polypeptide, 

an expression vector comprising a sequence encoding said polypeptide, or 
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- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

In another preferred embodiment, the composition herein comprises or further 
comprises 

- a polypeptide which comprises SEQ ID NO: 10, or comprises an antigenic 
fragment or variant of SEQ ID NO: 10, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

In another preferred embodiment, the composition herein comprises or further 
comprises 

- a polypeptide which comprises SEQ ID NO: 13, or comprises an antigenic 
fragment or variant of SEQ ID NO: 13, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

In another preferred embodiment, the composition herein comprises or further 
comprises 

- a polypeptide which comprises SEQ ID NO:28, or comprises an antigenic 
fragment or variant of SEQ ID NO:28, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

In another preferred embodiment, the composition further comprises 

- a polypeptide which comprises SEQ ID NO:20, or comprises an antigenic 
fragment or variant of SEQ ID NO:20, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, or 
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a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

Similarly, the invention relates to a method for the immunisation of an animal or 
5 human being against a Streptococcus pneumoniae infections comprising the step of 
administering any one or more of 

a polypeptide which comprises any of the sequences of SEQ ID NO:1-282, or 
comprises a fragment or variant of any of said sequences, 
a polynucleotide comprising a sequence encoding said polypeptide, 
10 an expression vector comprising a sequence encoding said polypeptide, or 

a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

thereby immunising said animal or human being against Streptococcus pneumoniae 
infections. 

15 

In one embodiment of the above method for immunisation, said 

polypeptide which comprises any of the sequences of SEQ ID NO: 1-282, 
preferably SEQ ID NO:16, or comprises a fragment or variant of any of said 
sequences, 

20 - polynucleotide comprising a sequence encoding said polypeptide, 

expression vector comprising a sequence encoding said polypeptide, or 
recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

is only given once, thereby immunising said animal or human being against 
25 Streptococcus pneumoniae infections through a single administration. 

The animal may be any bird or mammal, e.g. a chicken, duck, turkey, cow or pig. 
Particular target populations of human beings may be individuals from at-risk 
populations, such as the population of children up to 4 years old, the population of 
30 elderly persons or the population of naive or semi-immune travellers to the developing 
world. 

Because the polypeptides of the present invention are immunogenic and 
because they are abundant on the Streptococcus pneumoniae cell, a protective 
immune response can be induced even patients with a reduced ability to respond to 
35 antigenic stimuli, such as juveniles, elderly patients or immunocompromised patients. 
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Furthermore, for the same reasons, the vaccines of the invention can also be used to 
prevent otitis media, to prevent nasopharyngal carriage of Streptococcus 
pneumoniae, to prevent sepsis caused by Streptococcus pneumoniae, or to prevent 
meningitis caused by Streptococcus pneumoniae. 

Thus, in one embodiment, the present invention relates to the use of any one or more 
of 

- a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO:1-282, preferably SEQ ID NO:16, or comprises an antigenic fragment 
or variant of said sequence, 

a polynucleotide comprising a sequence encoding said polypeptide, 

an expression vector comprising a sequence encoding said polypeptide, or 

a recombinant virus or recombinant cell comprising said polynucleotide or said 

expression vector, 

for the preparation of a medicament for the immunisation of an animal or human 
being against Streptococcus pneumoniae infections, wherein said human being is a 
child of less than 4 yours of age, such as less than 2 years of age, e.g. less than 1 
year of age, and/or a child having maternal immunity (i.e. having maternal antibodies 
in circulation). 

In a further embodiment, the present invention relates to the use of any one or more 
of 

- a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO:1-282, preferably SEQ ID NO:16, or comprises an antigenic fragment 
or variant of said sequence, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

for the preparation of a medicament for the immunisation of an animal or human 
being against Streptococcus pneumoniae infections, wherein said human being is an 
immunocompromised patient. Immunocompromised patients could e.g. patients 
taking immunosuppressive chemotherapy or patients with congenital or acquired 
immune deficiency. For the immunisation to be effective in these patients, it is 
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required that the patient still to some extent is capable of producing an immune 
response. 

In another embodiment, the present invention relates to the use of any one or more of 
5 a polypeptide which comprises a sequence selected from the group consisting of 

SEQ ID NO:1-282, preferably SEQ ID NO:16, or comprises an antigenic fragment 
or variant of said sequence, 

a polynucleotide comprising a sequence encoding said polypeptide, 
an expression vector comprising a sequence encoding said polypeptide, or 
10 a recombinant virus or recombinant cell comprising said polynucleotide or said 

expression vector, 

for the preparation of a medicament for the prevention of otitis media, in particular 
otitis media due to Streptococcus pneumoniae. 

15 In yet another embodiment, the present invention relates to the use of any one or 
more of 

a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO: 1-282, preferably SEQ ID NO: 16, or comprises an antigenic fragment 
or variant of said sequence, 
20 a polynucleotide comprising a sequence encoding said polypeptide, 

an expression vector comprising a sequence encoding said polypeptide, or 

a recombinant virus or recombinant cell comprising said polynucleotide or said 

expression vector, 

for the preparation of a medicament for the treatment and/or prevention of 
25 nasopharyngal carriage of Streptococcus pneumoniae. 

In an even further embodiment, the present invention relates to the use of any one or 
more of 

a polypeptide which comprises a sequence selected from the group consisting of 
30 SEQ ID NO:1-282, preferably SEQ ID NO:16, or comprises an antigenic fragment 

or variant of said sequence, 

a polynucleotide comprising a sequence encoding said polypeptide, 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
35 expression vector, 
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for the preparation of a medicament for the prevention of Streptococcal meningitis. 

The vaccines may be administered in the dosages described herein by any suitable 
mode of administration, including modes of administration that result in a less than 
complete (e.g. less than 50% or less than 90%) uptake of all administered antigen. 
This is possible because the polypeptides of the present invention are sufficiently im- 
munogenic and because, due to their abundance on the surface of the Streptococcus 
pneumoniae cell, even a somewhat suboptimal response, can provide immunity. 
Thus, modes of administration of the composition according to the invention include, 
but are not limited to systemic administration, such as intravenous or subcutaneous 
administration, transdermal administration, intradermal administration, intramuscular 
administration, intranasal administration, oral administration, and generally any form 
of mucosal administration. 

An important problem relating to the production of effective Streptococcus vaccines is 
the occurrence of immunologically different types, also termed serotypes, of the bac- 
teria. These types differ considerably in their polysaccharide profile and also, albeit 
less, in some highly variable proteins. Due to such variability, vaccines known in the 
art often only work against some and not all serotypes. 

The vaccines of the present invention are based on abundant surface-located poly- 
peptides, which are not highly variable. These vaccines will be effective against a 
plurality of serotypes. Accordingly, in one embodiment, the invention relates to the 
use of any one or more of 

- a polypeptide which comprises a sequence selected from the group consisting of 
SEQ ID NO: 1-282, preferably selected from the group consisting of SEQ ID NO:1- 
41, most preferably SEQ ID NO:16, or comprises an antigenic fragment or variant 
of said sequence, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

for the preparation of a medicament for the immunisation of an animal or human be- 
ing against more than one serotype of Streptococcus pneumoniae, such as 5 or more 
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different serotypes, e.g. 8 or more different serotypes, such as 15 or more different 
serotypes, e.g. 24 or more different serotypes. 

Preferably, said more than one serotype includes a serotype selected from the group 
of 6A, 7C, 9A, 10B, 13, 15C, 16F, 18B, 21, 23A, 24F, 28F, 31, 34, 35F, 35B, 38. 

In one preferred embodiment, the medicament is used for the immunisation against at 
least the serotypes 4, 6B, 9V y 14, 18C, 19F and 23F, and, preferably, furthermore at 
least one further serotype, said further serotype preferably being selected from the 
group of 6A, 7C, 9A, 10B, 13, 15C, 16F, 18B, 21, 23A, 24F, 28F, 31, 34, 35F, 35B, 
38. 

In another preferred embodiment, the medicament is used for the immunization 
against at least the serotypes 1, 2, 3, 4, 5, 6B, 7F, 8, 9N, 9V, 10A, 11A, 12F, 14, 15B, 
17F, 18C, 19A, 19F, 20, 22F, 23F, and 33F, and preferably at least one further sero- 
type, preferably selected from the group of 6A, 7C, 9A, 10B, 13, 15C, 16F, 18B, 21, 
23A, 24F, 28F, 31, 34, 35F, 35B, 38. 

In a further preferred embodiment, the medicament is used for the immunisation 
against any of the serotypes given in Table 4, preferably at least 5 or more different 
serotypes selected from the serotypes given in Table 4, e.g. 8 or more different sero- 
types, such as 15 or more different serotypes, e.g. 24 or more different serotypes 
selected from the serotypes given in Table 4. 

The immunogenic effect according to the present invention can e.g. be measured by 
assay of antibodies in serum samples e.g. by a RIA. Furthermore, the effect can be 
determined in vivo, by measuring e.g. an increased T-cell responsiveness to T-cell 
dependent antigenic polypeptides, wherein said increased responsiveness is charac- 
teristic of an enhancement of a normal immune response to such antigenic polypep- 
tides. An immunostimulating effect may also be measured as an enhanced T cell pro- 
duction of, in particular, IL-2, IL-3, IFN-y and/or GM-CSF. Polypeptides or fragments 
thereof having a potential for eliciting an enhanced immune response may thus be 
readily identified by screening for enhanced IL-2, IL-3, IFN-y or GM-CSF production 
by T cells, as described e.g. in US 07/779,499, incorporated herein by reference. 
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A number of aspects related to vaccination against Streptococcus pneumo- 
niae have been discussed in Bogaert et al. (2004) Vaccine 22:2209-2220. This review 
includes references to other documents describing methods for testing and evaluation 
of such vaccines. 

5 The herein described polynucleotides and expression vectors can be intro- 

duced into target cells in vivo or in vitro by any standard method: e.g., as naked DNA 
(Donnelly et al., Annu Rev Immunol 15:617-648 [1997]), incorporated into ISCOMS, 
liposomes, or erythrocyte ghosts, or by biolistic transfer, calcium precipitation, or elec- 
troporation. Alternatively, one can employ a viral-based vector as a means for intro- 

10 ducing the polynucleotide encoding the polypeptide of interest into the cells of the 
animal or human being. Preferred viral vectors include those derived from replication- 
defective hepatitis viruses (e.g., HBV and HCV), retroviruses (see, e.g., WO89/07136; 
and Rosenberg et al., N Eng J Med 323 (9):570-578 [1990]), adenovirus (see, e.g., 
Morsey et al., J Cell Biochem, Supp. 17E [1993]), adeno-associated virus (Kotin et 

15 al., Proc Natl Acad Sci USA 87:2211-2215 [1990]), replication defective herpes sim- 
plex viruses (HSV; Lu et al., Abstract, page 66, Abstracts of the Meeting on Gene 
Therapy, Sep. 22-26, 1992, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
N.Y.), canary pox virus, and any modified versions of these vectors. Cells transfected 
in vitro can be cultured and cloned, if desired, prior to introduction into the patient. 

20 In addition to direct in vivo procedures, ex vivo procedures may be used in 

which cells are removed from an animal, modified, and placed into the same or an- 
other animal. It will be evident that one can utilise any of the compositions noted 
above for introduction of an antigenic polypeptides or polynucleotides encoding such 
according to the invention into tissue cells in an ex vivo context. Protocols for viral, 

25 physical and chemical methods of uptake are well known in the art. Thus, as an alter- 
native to administration of a polypeptide of the invention or a vector capable of ex- 
pressing such a polypeptide directly to the patient, one can remove helper T cells 
from the patient; stimulate those T cells ex vivo using the same antigenic polypeptide 
or vector; and introduce the stimulated helper T cells into the same patient. 

30 

Antibodies and methods for raising antibodies of the invention 

In a further main embodiment, the composition for use as a medicament comprises 
an antibody capable of binding a polypeptide selected from the group consisting of 
surface-located Streptococcus pneumoniae polypeptides of SEQ ID NO:1-282, 
35 preferably selected from the group consisting of SEQ ID NO:1-41, more preferably 
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selected from the group consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, 
SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most preferably 
the polypeptide of SEQ ID NO: 16. Such a medicament can be used for antibody 
therapy, such as passive immunisation of an individual in need thereof. 

5 

Accordingly, in a further main aspect, the invention relates to antibodies capable of 
binding, preferably specifically binding, a polypeptide selected from the group consist- 
ing of SEQ ID NO:1-282 and/or a fragment and/or a variant thereof Specifically bind- 
ing' is, in this context, not intended to mean absolute specificity. The antibody may in 
10 some embodiments also specifically bind polypeptides, e.g. from other Streptococcus 
species, with a high degree of sequence identity to the polypeptide from Streptococ- 
cus pneumoniae, e.g. polypeptides with more than 90%, such as more than 95% or 
more than 98% sequence identity to the polypeptide from Streptococcus pneumoniae. 

15 In a preferred embodiment, the antibody is capable of binding, preferably specifically 
binding, a polypeptide selected from the group consisting of SEQ ID NO:1-282, such 
as the polypeptide of SEQ ID NO:1, for example the polypeptide of SEQ ID NO:2, 
such as the polypeptide of SEQ ID NO:3, for example the polypeptide of SEQ ID 
NO:4, such as the polypeptide of SEQ ID NO:5, for example the polypeptide of SEQ 

20 ID NO:6, such as the polypeptide of SEQ ID NO:7, for example the polypeptide of 
SEQ ID NO:8, such as the polypeptide of SEQ ID NO:9, for example the polypeptide 
of SEQ ID NO:10, such as the polypeptide of SEQ ID NO:11, for example the poly- 
peptide of SEQ ID NO: 12, such as the polypeptide of SEQ ID NO: 13, for example the 
polypeptide of SEQ ID NO:14, such as the polypeptide of SEQ ID NO:15, for example 

25 the polypeptide of SEQ ID NO:16, such as the polypeptide of SEQ ID NO:17, for ex- 
ample the polypeptide of SEQ ID NO:18, such as the polypeptide of SEQ ID NO:19, 
for example the polypeptide of SEQ ID NO:20, such as the polypeptide of SEQ ID 
NO:21, for example the polypeptide of SEQ ID NO:22, such as the polypeptide of 
SEQ ID NO:23, for example the polypeptide of SEQ ID NO:24, such as the polypep- 

30 tide of SEQ ID NO:25, for example the polypeptide of SEQ ID NO:26, such as the 
polypeptide of SEQ ID NO:27, for example the polypeptide of SEQ ID NO:28, such as 
the polypeptide of SEQ ID NO:29, for example the polypeptide of SEQ ID NO:30, 
such as the polypeptide of SEQ ID NO:31, for example the polypeptide of SEQ ID 
NO:32, such as the polypeptide of SEQ ID NO:33, for example the polypeptide of 

35 SEQ ID NO:34, such as the polypeptide of SEQ ID NO:35, for example the polypep- 
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tide of SEQ ID NO:36, such as the polypeptide of SEQ ID NO:37, for example the 
polypeptide of SEQ ID NO:38, such as the polypeptide of SEQ ID NO:39, for example 
the polypeptide of SEQ ID NO:40, such as the polypeptide of SEQ ID NO:41, for ex- 
ample the polypeptide of SEQ ID NO:42, such as the polypeptide of SEQ ID NO:43, 
5 for example the polypeptide of SEQ ID NO:44, such as the polypeptide of SEQ ID 
NO:45, for example the polypeptide of SEQ ID NO:46, such as the polypeptide of 
SEQ ID NO:47, for example the polypeptide of SEQ ID NO:48, such as the polypep- 
tide of SEQ ID NO:49, for example the polypeptide of SEQ ID NO:50, such as the 
polypeptide of SEQ ID NO:51 , for example the polypeptide of SEQ ID NO:52, such as 

10 the polypeptide of SEQ ID NO:53, for example the polypeptide of SEQ ID NO:54, 
such as the polypeptide of SEQ ID NO:55, for example the polypeptide of SEQ ID 
NO:56, such as the polypeptide of SEQ ID NO:57, for example the polypeptide of 
SEQ ID NO:58, such as the polypeptide of SEQ ID NO:59, for example the polypep- 
tide of SEQ ID NO:60, such as the polypeptide of SEQ ID NO:61, for example the 

15 polypeptide of SEQ ID NO:62, such as the polypeptide of SEQ ID NO:63, for example 
the polypeptide of SEQ ID NO:64, such as the polypeptide of SEQ ID NO:65, for ex- 
ample the polypeptide of SEQ ID NO:66, such as the polypeptide of SEQ ID NO:67, 
for example the polypeptide of SEQ ID NO:68, such as the polypeptide of SEQ ID 
NO:69, for example the polypeptide of SEQ ID NO:70, such as the polypeptide of 

20 SEQ ID NO:71, for example the polypeptide of SEQ ID NO:72, such as the polypep- 
tide of SEQ ID NO:73, for example the polypeptide of SEQ ID NO:74, such as the 
polypeptide of SEQ ID NO:75, for example the polypeptide of SEQ ID NO:76, such as 
the polypeptide of SEQ ID NO:77, for example the polypeptide of SEQ ID NO:78, 
such as the polypeptide of SEQ ID NO:79, for example the polypeptide of SEQ ID 

25 NO:80, such as the polypeptide of SEQ ID NO:81, for example the polypeptide of 
SEQ ID NO:82, such as the polypeptide of SEQ ID NO:83, for example the polypep- 
tide of SEQ ID NO:84, such as the polypeptide of SEQ ID NO:85, for example the 
polypeptide of SEQ ID NO:86, such as the polypeptide of SEQ ID NO:87, for example 
the polypeptide of SEQ ID NO:88, such as the polypeptide of SEQ ID NO:89, for ex- 

30 ample the polypeptide of SEQ ID NO:90, such as the polypeptide of SEQ ID NO:91, 
for example the polypeptide of SEQ ID NO:92, such as the polypeptide of SEQ ID 
NO:93, for example the polypeptide of SEQ ID NO:94, such as the polypeptide of 
SEQ ID NO:95, for example the polypeptide of SEQ ID NO:96, such as the polypep- 
tide of SEQ ID NO:97, for example the polypeptide of SEQ ID NO:98, such as the 

35 polypeptide of SEQ ID NO:99, for example the polypeptide of SEQ ID NO:100, such 
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as the polypeptide of SEQ ID NO:101, for example the polypeptide of SEQ ID 
NO:102, such as the polypeptide of SEQ ID NO:103, for example the polypeptide of 
SEQ ID NO: 104, such as the polypeptide of SEQ ID NO: 105, for example the poly- 
peptide of SEQ ID NO:168. 
5 In preferred embodiments, the antibodies of the invention are furthermore 

capable of binding an intact Streptococcus pneumoniae cell, i.e. capable of binding a 
living or a dead Streptococcus cell which has maintained its structural integrity, pref- 
erably a cell that has maintained the integrity of the membrane (i.e. wherein the 
membrane has not been permeabilised). Binding of antibodies to intact cells can e.g. 
10 be determined by flow cytometry as described in Rioux et al.(2001) Infect. Immun. 
69:5162-5165 or as described in Singh et al. (2003) Infect. Immun. 71:3937-3946. 

Preferred antibodies are ones that bind with a dissociation constant or Kd of less than 
5 X 10" 6 M, such as less than 10" 6 M, e.g. less than 5 X 10~ 7 M, such as less than 10" 7 M, 

15 e.g. less than 5 X 10~ 8 M, such as less than 10" 8 M, e.g. less than 5 X 10" 9 M, such as 
less than 10" 9 M, e.g. less than 5 X 10~ 10 M, such as less than 10" 10 M, e.g. less than 5 
X 10" 11 M, such as less than 10" 11 M, e.g. less than 5 X 10" 12 M, such as less than 10" 
12 M, e.g. less than 5 X 10" 13 M, such as less than 10" 13 M, e.g. less than 5 X 10" 14 M, 
such as less than 10" 14 M, e.g. less than 5 X 10" 15 M, or less than 10" 15 M. Binding con- 

20 stants can be determined using methods well-known in the art, such as ELISA (e.g. 
as described in Orosz and Ovadi (2002) J. Immunol. Methods 270:155-162) or sur- 
face plasmon resonance analysis. 

Antibodies can be used for passive immunisation of mammals, preferably human 
25 beings, more preferably immunocompromised patients. A treatment with antibodies 
can be done to cure or to prevent Streptococcus pneumoniae infections, including 
pneumococcal diseases, such as pneumonia or meningitis or pneumococcal sepsis. 
Preferred patient groups include children under the age of 4 years, elderly patients or 
immunocompromised patients. 

30 

Antibodies of the invention include the following preferred mechanistic groups: 
1 . Function-inhibiting antibodies that work as an antibacterial (affect the viability of 
the bacterium). Such antibodies should be effective regardless of the immune 
status of the patient. Preferably, such antibodies are capable of reducing 
35 Streptococcus pneumoniae growth in vitro to less than 50%, such as less than 
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25%, for example less than 10%, such as less than 5% of a control without 
antibody added. 

2. Opsonising antibodies that are designed to enhance phagocytic killing. 
Effectiveness of such antibodies may depend on the immune status of the patient, 
but it is very well possible that they will enhance phagocytic killing even in 
compromised patients. 

3. Antibodies conjugated to a therapeutic moiety such as a toxin or bactericidal 
agent, e g. ricin or radioisotopes. Techniques for conjugating a therapeutic moiety 
to antibodies are well known, see, e.g. Thorpe et al.(1982) Immunol. Rev. 62, 
119-158. These antibodies should also be effective regardless of the immune 
status of the patient. 

An antibody with or without a therapeutic moiety conjugated to it can be used as a 
therapeutic that is administered alone or in combination with chemotherapeutics or 
other therapeutic agents. 

In one embodiment, the antibodies of the invention are opsonising as well as function- 
inhibiting. In another embodiment, the antibodies of the invention are opsonising, but 
not function-inhibiting. The latter group of antibodies can e.g. be antibodies directed 
against a target polypeptide which is not essential for the viability of Streptococcus 
pneumoniae. 

In a further main aspect, the invention relates to a method for raising antibodies to a 
polypeptide selected from the group consisting of SEQ ID NO:1-282, in a non-human 
animal comprising the steps of 
a. providing 

- a polypeptide comprising a sequence selected from the group consisting of 
SEQ ID NO: 1-282, preferably selected from the group consisting of SEQ ID 
NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ 
ID NO:28 and SEQ ID NO:33, most preferably SEQ ID NO: 16, or comprising 
an antigenic fragment or variant of said sequence, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, 
or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 



WO 2006/084467 PCT/DK2006/000073 

42 

b. introducing a composition comprising said polypeptide, polynucleotide, vector, 
recombinant virus or recombinant cell into said animal, 

c. raising antibodies in said animal, 

d. isolating and optionally purifying the antibodies. 

In a preferred embodiment, antibodies capable of binding an intact Streptococcus 
pneumoniae cell are identified by comprising performing the above steps and the 
further step of selecting antibodies capable of binding an intact Streptococcus 
pneumoniae cell. 

The above methods are preferably done in a transgenic animal which is capable of 
producing human antibodies. In a further preferred embodiment, the above methods 
are non-therapeutic. 

Monoclonal/polvclonal antibodies 

Antibodies of the invention may be polyclonal antibodies or monoclonal antibodies or 
mixtures of monoclonal antibodies. In a preferred embodiment, the antibody is a 
monoclonal antibody or a fragment thereof. Monoclonal antibodies (Mab's) are anti- 
bodies wherein every antibody molecule is similar and thus recognises the same epi- 
tope. The antibody may be any kind of antibody, however, it is preferably an IgG or 
IgA antibody. 

Preferred antibodies, more preferably monoclonal antibodies, are antibodies 
capable of specifically binding surface-exposed regions of the polypeptides of the 
invention. Accordingly, in a preferred embodiment of an antibody capable of binding 
SEQ ID NO:16, said antibody binds an epitope on SEQ ID NO:16 which comprises 
one or more amino acids of any of SEQ ID NO:289-SEQ ID NO:294. Even more pref- 
erably, said antibody binds an epitope which comprises two or more, such as three or 
more, e,g, four or more, such as five or more amino acids of a sequence selected 
from the group consisting of SEQ ID NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ 
ID NO:292, SEQ ID NO:293 and SEQ ID NO:294. 

Monoclonal antibodies are in general produced by a hybridoma cell line. 
Methods of making monoclonal antibodies and antibody-synthesising hybridoma cells 
are well known to those skilled in the art. Antibody-producing hybridomas may for 
example be prepared by fusion of an antibody-producing B lymphocyte with an im- 
mortalised cell line. A monoclonal antibody can be produced by the following steps. 
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An animal is immunised with an antigen such as a full-length polypeptide or a frag- 
ment thereof. The immunisation is typically accomplished by administering the anti- 
gen to an immunologically competent mammal in an immunologically effective 
amount, i.e., an amount sufficient to produce an immune response. Preferably, the 
5 mammal is a rodent such as a rabbit, rat or mouse. The mammal is then maintained 
on a booster schedule for a time period sufficient for the mammal to generate high 
affinity antibody molecules. A suspension of antibody-producing cells is removed from 
each immunised mammal secreting the desired antibody. After a sufficient time to 
generate high affinity antibodies, the animal (e.g. mouse) is sacrificed and antibody- 

10 producing lymphocytes are obtained from one or more of the lymph nodes, spleens 
and peripheral blood. Spleen cells are preferred, and can be mechanically separated 
into individual cells in a physiological medium using methods well known to one of 
skill in the art. The antibody-producing cells are immortalised by fusion to cells of a 
mouse myeloma line. Mouse lymphocytes give a high percentage of stable fusions 

15 with mouse homologous myelomas, however, rat, rabbit and frog somatic cells can 
also be used. Spleen cells of the desired antibody-producing animals are immortal- 
ised by fusing with myeloma cells, generally in the presence of a fusing agent such as 
polyethylene glycol. Any of a number of myeloma cell lines suitable as a fusion part- 
ner can be, for example, the P3-NS1/1-Ag4-1, P3-x63-Ag8.653 or Sp2/0-Ag14 mye- 

20 loma lines, available from the American Type Culture Collection (ATCC), Rockville, 
Md. 

Monoclonal antibodies can also be generated by other methods well known 
to those skilled in the art of recombinant DNA technology. An alternative method, re- 
ferred to as the "combinatorial antibody display" method, has been developed to iden- 

25 tify and isolate antibody fragments having a particular specificity, and can be utilised 
to produce monoclonal antibodies. 

A polyclonal antibody is a mixture of antibody molecules recognising a spe- 
cific given antigen, hence polyclonal antibodies may recognise different epitopes 
within e.g. a polypeptide. In general polyclonal antibodies are purified from serum of a 

30 mammal, which previously has been immunised with the antigen. Polyclonal antibod- 
ies may for example be prepared by any of the methods described in Antibodies: A 
Laboratory Manual, By Ed Harlow and David Lane, Cold Spring Harbor Laboratory 
Press, 1988. Polyclonal antibodies may be derived from any suitable mammalian 
species, for example from mice, rats, rabbits, donkeys, goats, and sheep. 
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Specificity 

The antibodies of the invention may be monospecific towards any of the polypeptides 
of SEQ ID NO:1-282. In another embodiment, the antibody is bispecific or multispeci- 
fic having at least one portion being specific towards any of the polypeptides of SEQ 
5 ID NO:l-282. 

Monospecific antibodies may be monovalent, i.e. having only one binding 
domain. For a monovalent antibody, the immunoglobulin constant domain amino-acid 
sequences preferably comprise the structural portions of an antibody molecule known 
in the art as CH1, CH2, CHS and CH4. Preferred are those which are known in the art 

10 as C u . Furthermore, insofar as the constant domain can be either a heavy or light 
chain constant domain (C H or C L , respectively), a variety of monovalent antibody 
compositions are contemplated by the present invention. For example, light chain 
constant domains are capable of disulphide bridging to either another light chain con- 
stant domain, or to a heavy chain constant domain. In contrast, a heavy chain con- 

15 stant domain can form two independent disulphide bridges, allowing for the possibility 
of bridging to both another heavy chain and to a light chain, or to form polymers of 
heavy chains. Thus, in another embodiment, the invention contemplates a composi- 
tion comprising a monovalent polypeptide wherein the constant chain domain C has a 
cysteine residue capable of forming at least one disulphide bridge, and where the 

20 composition comprises at least two monovalent polypeptides covalently linked by said 
disulphide bridge. 

In another embodiment of the invention the antibody is a multivalent antibody 
having at least two binding domains. The binding domains may have specificity for 
the same ligand or for different ligands. 

25 

Multispecificitv, including bispecificitv 

In a preferred embodiment the invention relates to multispecific antibodies, which 
have affinity for and are capable of specifically binding at least two different entities. 

In one embodiment, the multispecific antibody is a bispecific antibody, which 

30 carries at least two different binding domains, at least one of which is of antibody ori- 
gin. A bispecific molecule of the invention can also be a single chain bispecific mole- 
cule. Multispecific molecules can also be single-chain molecules or may comprise at 
least two single-chain molecules. The multispecific, including bispecific antibodies, 
may be produced by any suitable manner known to the person skilled in the art. A 

35 number of approaches have been developed such as the ones described in WO 
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94/09131; WO 94/13804; WO 94/13806 or U.S. Pat. Nos. 5,260,203; 5,455,030; 
4,881,175; 5,132,405; 5,091,513; 5,476,786; 5,013,653; 5,258,498; and 5,482,858. 
Using a bispecific or multispecific antibody according to the invention the invention 
offers several advantages as compared to monospecific/monovalent antibodies. A 
5 bispecific/multispecific antibody has a first binding domain capable of specifically rec- 
ognising and binding any of the Streptococcus pneumoniae polypeptides of SEQ ID 
NO: 1-282, whereas the other binding domain(s) may be used for other purposes. In 
one embodiment, at least one other binding domain is used for binding to a Strepto- 
coccus pneumoniae polypeptide, such as binding to another epitope on the same 

10 Streptococcus pneumoniae polypeptide as the first binding domain. Thereby specific- 
ity for Streptococcus pneumoniae may be increased as well as increase of avidity of 
the antibody. In another embodiment the at least one other binding domain may be 
used for specifically binding a mammalian cell, such as a human cell. It is preferred 
that the at least other binding domain is capable of binding an immunoactive cell, 

15 such as a leukocyte, a macrophage, a lymphocyte, a basophilic cell, and/or an eosi- 
nophilic cell, in order to increase the effect of the antibody in a therapeutic method. 
This may be accomplished by establishing that the at least one other binding domain 
is capable of specifically binding a mammalian protein, such as a human protein, 
such as a protein selected from any of the cluster differentiation proteins (CD), in par- 

20 ticular CD64 and/or CD89. 

Humanised antibodies 

It is not always desirable to use non-human antibodies for human therapy, since the 
non-human "foreign" epitopes may elicit an immune response in the individual to be 

25 treated. To eliminate or minimise the problems associated with non-human antibod- 
ies, it is desirable to engineer chimeric antibody derivatives, i.e., "humanised" anti- 
body molecules that combine the non-human Fab variable region binding determi- 
nants with a human constant region (Fc). Such antibodies are characterised by 
equivalent antigen specificity and affinity of the monoclonal and polyclonal antibodies 

30 described above, and are less immunogenic when administered to humans, and 
therefore more likely to be tolerated by the individual to be treated. 

Accordingly, in one embodiment the antibody of the invention is a humanised 
antibody. Humanised antibodies are in general chimeric antibodies comprising re- 
gions derived from a human antibody and regions derived from a non-human anti- 

35 body, such as a rodent antibody. Humanisation (also called Reshaping or CDR- 
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grafting) is a well-established technique for reducing the immunogenicity of mono- 
clonal antibodies (mAbs) from xenogeneic sources (commonly rodent), increasing the 
homology to a human immunoglobulin, and for improving their activation of the hu- 
man immune system. Thus, humanised antibodies are typically human antibodies in 
5 which some CDR residues and possibly some framework residues are substituted by 
residues from analogous sites in rodent antibodies. 

It is important that humanised antibodies retain high affinity for the antigen 
and other favourable biological properties. To achieve this goal, according to a pre- 
ferred method, humanised antibodies are prepared by a process of analysis of the 

10 parental sequences and various conceptual humanised products using three- 
dimensional models of the parental and humanised sequences. Three-dimensional 
immunoglobulin models are commonly available and are familiar to those skilled in 
the art. Computer programs are available which illustrate and display probable three- 
dimensional conformational structures of selected candidate immunoglobulin se- 

15 quences. Inspection of these displays permits analysis of the likely role of certain 
residues in the functioning of the candidate immunoglobulin sequence, i.e., the analy- 
sis of residues that influence the ability of the candidate immunoglobulin to bind its 
antigen. In this way, FR residues can be selected and combined from the recipient 
and import sequences so that the desired antibody characteristic, such as increased 

20 affinity for the target antigen(s), is maximised, although it is the CDR residues that 
directly and most substantially influence antigen binding. 

One method for humanising MAbs relates to production of chimeric antibod- 
ies in which an antigen binding site comprising the complete variable domains of one 
antibody is fused to constant domains derived from a second antibody, preferably a 

25 human antibody. Methods for carrying out such chimerisation procedures are for ex- 
ample described in EP-A-0 120 694 (Celltech Limited), EP-A-0 125 023 (Genentech 
Inc.), EP-A-0 171 496 (Res. Dev. Corp. Japan), EP-A-01 73494 (Stanford University) 
and EP-A-0 194 276 (Celltech Limited). 

The humanised antibody of the present invention may be made by any 

30 method capable of replacing at least a portion of a CDR of a human antibody with a 
CDR derived from a non-human antibody. Winter describes a method which may be 
used to prepare the humanised antibodies of the present invention (UK Patent Appli- 
cation GB 2188638A), the contents of which are incorporated by reference. 

As an example, the humanised antibodies of the present invention may be 

35 produced by the following process: 
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(a) constructing, by conventional techniques, an expression vector containing an op- 
era n with a DNA sequence encoding an antibody heavy chain in which the CDRs 
and such minimal portions of the variable domain framework region that are re- 
quired to retain antibody binding specificity are derived from a non-human immu- 

5 noglobulin, and the remaining parts of the antibody chain are derived from a hu- 

man immunoglobulin; 

(b) constructing, by conventional techniques, an expression vector containing an op- 
eron with a DNA sequence encoding a complementary antibody light chain in 
which the CDRs and such minimal portions of the variable domain framework re- 

10 gion that are required to retain donor antibody binding specificity are derived from 

a non-human immunoglobulin, and the remaining parts of the antibody chain are 
derived from a human immunoglobulin; 

(c) transfecting the expression vectors into a host cell by conventional techniques; 
and 

15 (d) culturing the transfected cell by conventional techniques to produce the human- 
ised antibody. 

The host cell may be co-transfected with the two vectors of the invention, the first vec- 
tor containing an operon encoding a light chain derived polypeptide and the second 

20 vector containing an operon encoding a heavy chain derived polypeptide. The two 
vectors contain different selectable markers, but otherwise, apart from the antibody 
heavy and light chain coding sequences, are preferably identical, to ensure, as far as 
possible, equal expression of the heavy and light chain polypeptides. Alternatively, a 
single vector may be used, the vector including the sequences encoding both the light 

25 and the heavy chain polypeptides. The coding sequences for the light and heavy 
chains may comprise cDNA or genomic DNA or both. 

The host cell used to express the altered antibody of the invention may be ei- 
ther a bacterial cell such as Escherichia coli, or a eukaryotic cell. In particular a 
mammalian cell of a well defined type for this purpose, such as a myeloma cell or a 

30 Chinese hamster ovary cell may be used. 

The general methods by which the vectors of the invention may be con- 
structed, transfection methods required to produce the host cell of the invention and 
culture methods required to produce the antibody of the invention from such host cells 
are all conventional techniques. Likewise, once produced, the humanised antibodies 

35 of the invention may be purified according to standard procedures. 
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Human antibodies 

In a more preferred embodiment the invention relates to an antibody, wherein the 
binding domain is carried by a human antibody, i.e. wherein the antibodies have a 
5 greater degree of human peptide sequences than do humanised antibodies. 

Human mAb antibodies directed against human proteins can be generated 
using transgenic mice carrying the human immune system rather than the mouse 
system. Splenocytes from these transgenic mice immunised with the antigen of inter- 
est are used to produce hybridomas that secrete human mAbs with specific affinities 

10 for epitopes from a human protein (see, e.g., Wood et al. International Application 
WO 91/00906, Kucherlapati et al. PCT publication WO 91/10741; Lonberg et al. In- 
ternational Application WO 92/03918; Kay et al. International Application 92/03917; 
Lonberg, N. et al. 1994 Nature 368:856-859; Green, L. L. et al. 1994 Nature Genet. 
7:13-21; Morrison, S. L et al. 1994 Proc. Natl. Acad. Sci. USA 81:6851-6855; Brug- 

15 geman et al. 1993 Year Immunol 7:33-40; Tuaillon et al. 1993 PNAS 90:3720-3724; 
Bruggeman et al. 1991 Eur J Immunol 21:1323-1326). Such transgenic mice are 
available from Abgenix, Inc., Fremont, Calif., and Medarex, Inc., Annandale, N.J. It 
has been described that the homozygous deletion of the antibody heavy-chain joining 
region (IH) gene in chimeric and germ-line mutant mice results in complete inhibition 

20 of endogenous antibody production. Transfer of the human germ-line immunoglobulin 
gene array in such germ-line mutant mice will result in the production of human anti- 
bodies upon antigen challenge. See, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. 
USA 90:2551 (1993); Jakobovits et al., Nature 362:255-258 (1993); Bruggemann et 
al., Year in Immunol. 7:33 (1993); and Duchosal et al. Nature 355:258 (1992). Human 

25 antibodies can also be derived from phage-display libraries (Hoogenboom et al ., J. 
Mol. Biol. 227: 381 (1992); Marks et al., J. Mol. Biol. 222:581-597 (1991); Vaughan, et 
al., Nature Biotech 14:309 (1996)). 

A preferred method for the isolation of high affinity antibodies is a subtractive 
30 procedure were human antibodies or antibody fragments against the targets, in 
particular against the antigens 029 (SEQ ID NO:16) and 607 (SEQ ID NO:20), in their 
native configuration can be rapidly obtained from a phage antibody library (see, e.g. 
De Kruif et al., Proc. Natl. Acad. Sci. USA 92:3938-3942 (1995); US patent 
62651 50;and US patent applications 2002132228 and 2005043521). The phage 
35 antibody libraries can e.g. be constructed using antibody producing cells from patients 
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with the disease of interest, here patients infected with Streptococcus pneumoniae. 
The genes coding for the antibodies produced by these cells may be cloned into a 
semi-synthetic phage antibody library using degenerated oligonucleotides rearranging 
the CDR3 Region of the cloned genes. Afterwards the library is incubated with the 
5 target antigen or target-expressing cells, here Streptococcus pneumoniae, and the 
phage antibodies bound to the target are isolated by using standard methods. The 
present invention is also directed to antibodies identified by the above procedure, in 
particular antibodies capable of binding a polypeptide selected from group consisting 
of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, 
10 SEQ ID NO:28 and SEQ ID NO:33, preferably antibodies having a dissociation 
constant or Kd of less than 10" 7 M, e.g. less than 10" 8 M, such as less than 1CT 9 M, e.g. 
less than than 10" 10 M, such as less than 10" 11 M and/or antibodies binding a surface- 
exposed epitope of any of these targets. 

15 Suitable methods for producing human monoclonal antibodies have further- 

more been described in WO 03/017935, WO 02/100348, US 2003 091561, and US 
2003 194403. 

Binding fragments of antibodies 

20 In one embodiment of the invention, the antibody is a fragment of an antibody, pref- 
erably an antigen binding fragment or a variable region. Examples of antibody frag- 
ments useful with the present invention include Fab, Fab 1 , F(ab') 2 and Fv fragments. 
Papain digestion of antibodies produces two identical antigen binding fragments, 
called the Fab fragment, each with a single antigen binding site, and a residual "Fc" 

25 fragment, so-called for its ability to crystallise readily. Pepsin treatment yields an 
F(ab') 2 fragment that has two antigen binding fragments which are capable of cross- 
linking antigen, and a residual other fragment (which is termed pFc'). Additional frag- 
ments can include diabodies, linear antibodies, single-chain antibody molecules, and 
multispecific antibodies formed from antibody fragments. 

30 The antibody fragments Fab, Fv and scFv differ from whole antibodies in that 

the antibody fragments carry only a single antigen-binding site. Recombinant frag- 
ments with two binding sites have been made in several ways, for example, by 
chemical cross-linking of cysteine residues introduced at the C-terminus of the VH of 
an Fv (Cumber et al., 1992), or at the C-terminus of the VL of an scFv (Pack and 
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Pluckthun, 1992), or through the hinge cysteine residues of Fab's (Carter et al., 
1992). 

Preferred antibody fragments retain some or essentially all of the ability of an 
antibody to selectively binding with its antigen. Some preferred fragments are defined 
5 as follows: 

(1) Fab is the fragment that contains a monovalent antigen-binding fragment of an 
antibody molecule. A Fab fragment can be produced by digestion of whole anti- 
body with the enzyme papain to yield an intact light chain and a portion of one 
heavy chain. 

10 (2) Fab' is the fragment of an antibody molecule and can be obtained by treating 

whole antibody with pepsin, followed by reduction, to yield an intact light chain 
and a portion of the heavy chain. Two Fab' fragments are obtained per antibody 
molecule. Fab' fragments differ from Fab fragments by the addition of a few resi- 
dues at the carboxyl terminus of the heavy chain CH1 domain including one or 

1 5 more cysteines from the antibody hinge region. 

(3) (Fab') 2 is the fragment of an antibody that can be obtained by treating whole anti- 
body with the enzyme pepsin without subsequent reduction. F(ab') 2 is a dimer of 
two Fab' fragments held together by two disulfide bonds. 

(4) Fv is the minimum antibody fragment that contains a complete antigen recognition 
20 and binding site. This region consists of a dimer of one heavy and one light chain 

variable domain in a tight, non-covalent association (V H -V L dimer). It is in this 
configuration that the three CDRs of each variable domain interact to define an 
antigen binding site on the surface of the V H -V L dimer. Collectively, the six CDRs 
confer antigen binding specificity to the antibody. However, even a single variable 
25 domain (or half of an Fv comprising only three CDRs specific for an antigen) has 

the ability to recognise and bind antigen, although at a lower affinity than the en- 
tire binding site. 

In one embodiment of the present invention the antibody is a single-chain antibody, 
30 defined as a genetically engineered molecule containing the variable region of the 
light chain, the variable region of the heavy chain, linked by a suitable polypeptide 
linker as a genetically fused single chain molecule. Such single-chain antibodies are 
also referred to as "single-chain Fv" or "sFv" antibody fragments. Generally, the Fv 
polypeptide further comprises a polypeptide linker between the V H and V L domains 
35 that enables the sFv to form the desired structure for antigen binding. 
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The antibody fragments according to the invention may be produced in any 
suitable manner known to the person skilled in the art. Several microbial expression 
systems have already been developed for producing active antibody fragments, e.g. 
the production of Fab in various hosts, such as E. coli or yeast has been described. 
5 The fragments can be produced as Fab's or as Fv's, but additionally it has been 
shown that a Vh and a V L can be genetically linked in either order by a flexible poly- 
peptide linker, which combination is known as an scFv. 

1 0 Compositions for use in the invention 

In a preferred embodiment of the composition for use as a medicament, said 
composition comprises, in addition to the active component, a pharmaceutically- 
acceptable carrier. 

15 As used herein, the term "pharmaceutically acceptable" used in connection with com- 
positions or carriers represents that the materials are capable of being administered 
to or upon a human or animal without the production of undesirable physiological ef- 
fects such as nausea, dizziness, gastric upset and the like. 

20 The preparation of a composition that contains active ingredients dissolved or dis- 
persed therein is well understood in the art. Often such compositions are prepared as 
sterile injectables either as liquid solutions or suspensions, aqueous or non-aqueous, 
however, solid forms suitable for solution, or suspension, in liquid prior to use can 
also be prepared. The preparation can also be emulsified. The active ingredient can 

25 be mixed with carriers which are pharmaceutically acceptable and compatible with the 
active ingredient and in amounts suitable for use in the methods described herein. 
Suitable carriers are, for example, water, saline, dextrose, glycerol, ethanol or the like 
and combinations thereof. In addition, if desired, the composition can contain minor 
amounts of auxiliary substances such as wetting or emulsifying agents, pH-buffering 

30 agents and the like which enhance the effectiveness of the active ingredient. 

The compositions of the present invention can include pharmaceutically-acceptable 
salts of the active components therein. Pharmaceutically acceptable salts include the 
acid addition salts (formed with the free amino groups of the polypeptide) that are 
35 formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or 
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such organic acids as acetic, tartaric, mandelic and the like. Salts formed with the free 
carboxyl groups can also be derived from inorganic bases such as, for example, so- 
dium, potassium, ammonium, calcium or ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine and the like. 
5 Pharmaceutically-acceptable carriers are well known in the art. Exemplary of 

liquid carriers are sterile aqueous solutions that contain no materials in addition to the 
active ingredients and water, or contain a buffer such as sodium phosphate at physio- 
logical pH value, physiological saline or both, such as phosphate-buffered saline. Still 
further, aqueous carriers can contain more than one buffer salt, as well as salts such 
10 as sodium and potassium chlorides, dextrose, propylene glycol, polyethylene glycol 
and other solutes. Liquid compositions can also contain liquid phases in addition to 
and to the exclusion of water. Exemplary of such additional liquid phases are glycerin, 
vegetable oils such as cottonseed oil, organic esters such as ethyl oleate, and water- 
oil emulsions. 

15 

The composition may also be a kit-in-part further including an antibiotic agent, such 
as antibiotics selected from vancomycin, (3-lactams, cephalosporins, penicilins, ami- 
noglycosides, macrolide antibiotics (erythromycin, clarithromycin, or azithromycin) 
and fluoroquinolone antibiotics (ciprofloxacin, levofloxacin, gatifloxacin, or moxiflox- 
20 acin) and/or including an immunostimulating agent, such as cytokines, interferons, 
growth factors, for example GCSF or GM-CSF. The kit-in-part may be used for simul- 
taneous, sequential or separate administration. 

The invention furthermore relates to pharmaceutical compositions useful for practising 
25 the methods described herein. Thus, the invention relates to a pharmaceutical 
composition comprising a pharmaceutically-acceptable carrier and 

an isolated polypeptide which comprises any of the sequences of SEQ ID NO:1- 
282, preferably selected from the group consisting of SEQ ID NO:16, SEQ ID 
NO:10, SEQ ID NO:13, SEQ ID NO:20, and SEQ ID NO:28, most preferably SEQ 
30 ID NO: 16, or comprises a fragment or variant of any of said sequences, 

an isolated polynucleotide comprising a sequence encoding said polypeptide, 
an expression vector comprising a sequence encoding said polypeptide, 
or 

a recombinant virus or recombinant cell comprising said polynucleotide or said 
35 expression vector. 
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Furthermore, the invention relates to a pharmaceutical composition comprising an 
antibody of the invention, preferably an antibody capable of binding a polypeptide 
selected from the group consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, 
5 SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most preferably 
SEQ ID NO: 16, as defined herein and a pharmaceutically-acceptable carrier. 

Polypeptides of the invention 

Fragments of the invention 

10 In a further aspect, the invention relates to a fragment, preferably an antigenic frag- 
ment, of a polypeptide set forth in any of SEQ ID NO:1-282, preferably SEQ ID 
NO:16. Antigenicity can be predicted by various methods known in the art. The length 
of such fragments may vary from 2 consecutive amino-acid residues of a polypeptide 
to the full-length polypeptide minus one amino-acid residue. Preferably, fragments are 

15 less than 100 consecutive amino acids, such as less than 70 or 50 consecutive amino 
acids, e.g. less than consecutive 40 amino acids, such as less than 30 consecutive 
amino acids, e.g. less than 25 consecutive amino acids, such as less than consecu- 
tive 20 amino acids in length. Thus, for example fragments can be 
2,3,4,5,6,7,8,9,10,11,12,13,14, 15,16,17,18,19 or 20 consecutive amino acids in 

20 length. In further preferred embodiments, a fragment comprises 6 or more, such as 7 
or more, e.g. 8 or more, such as 9 or more, e.g. 10 or more consecutive amino acids 
of the corresponding full-length sequence. Preferred ranges include fragments of be- 
tween 5 and 50 consecutive amino acids in length, such as between 5 and 25 con- 
secutive amino acids in length, e.g. between 5 and 20 consecutive amino acids in 

25 length. Expressed in another way, a fragment consists of a part of an amino-acid se- 
quence which is less than 100% in length as compared to the full-length polypeptide. 
Preferably, the length of the fragment is less than 99%, such as less than 75%, e.g. 
less than 50%, such as less than 25%, e.g. less than 20%, such as less than 15%, 
e.g. less than 10% of the length of the full-length polypeptide. In further preferred em- 

30 bodiments, the fragment consists of a part of an amino-acid sequence which is less 
than 100%, but more than 1% in length as compared to the full-length polypeptide, 
such as less than 100% but more than 5%, e.g. less than 100% but more than 10%, 
such as less than 100% but more than 20%, e.g. less than 100% but more than 25%, 
such as less than 100% but more than 50% of the length of the full-length polypep- 

35 tide. 
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Preferably, fragments of the invention are surface-exposed in an intact Strep- 
tococcus pneumoniae cell or other cell when expressed recombinantly therein. Sur- 
face-exposure can be e.g. be determined using a monoclonal antibody specific for 
said fragment, e.g. as described in Singh et al. (2003) Infect. Immun. 71:3973-3946. 
5 Also preferred are fragments which are capable of inducing antibodies that can spe- 
cifically bind an intact Streptococcus pneumoniae cell. This can be determined by 
generating monoclonal antibodies using said fragment and subsequent characterisa- 
tion of the binding of individual antibodies to intact cells, e.g. as described in Singh et 
al. (2003) Infect. Immun. 71:3973-3946. Preferred fragments of SEQ ID NO:16 in- 
10 elude fragments comprising or consisting of one or more of the sequences of SEQ ID 
NO:289-SEQ ID NO:294, more preferred fragments include fragments comprising or 
consisting of SEQ ID NO:289 and/or SEQ ID NO:290, fragments comprising or con- 
sisting of SEQ ID NO: 291 and/or SEQ ID NO: 292, and fragments comprising or con- 
sisting of SEQ ID NO:293 and/or SEQ ID NO:294. 

15 

The full-length polypeptides of SEQ ID NO: 1-282 as well as the fragments of the in- 
vention can be produced recombinantly by conventional techniques known in the art. 
Suitable host cells can be mammalian cells, e.g. CHO, COS or HEK293 cells. Alter- 
natively, insect cells, bacterial cells or fungal cells can be used. Methods for heterolo- 
20 gous expression of polynucleotide sequences in the cell types listed above and sub- 
sequent purification of the produced polypeptides, e.g. using a tag sequence such as 
a histidine tag, which may be removed after purification, are well-known to those 
skilled in the art. Alternatively, fragments of the invention can be produced syntheti- 
cally. 

25 

Variants of the invention 

In a further main aspect, the invention relates to the use of variants of any of the 
polypeptides set forth in SEQ ID NO: 1-282, preferably SEQ ID NO: 16, or variants of 
fragments of any of the polypeptides set forth in SEQ ID NO: 1-282, preferably SEQ 
30 ID NO:16, in a composition for use as a medicament. When used herein, phrases 
such as 'a polypeptide having at least 95% sequence identity to SEQ ID NO:X' are 
used interchangeably with, and are intended to be directed to the same subject- 
matter as, phrases such as 'the polypeptide of SEQ ID NO:X and variants thereof, 
wherein the variant has at least 95% sequence identity to said sequence.' 



35 
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Variants preferably have at least 75% sequence identity, for example at least 80% 
sequence identity, such as at least 85% sequence identity, for example at least 90% 
sequence identity, such as at least 91% sequence identity, such as at least 92% 
sequence identity, for example at least 93% sequence identity, such as at least 94% 
5 sequence identity, for example at least 95% sequence identity, such as at least 96% 
sequence identity, for example at least 97% sequence identity, such as at least 98% 
sequence identity, for example 99% sequence identity with the given polypeptide or 
fragment. Sequence identity is determined with any of the algorithms GAP, BESTFIT, 
or FASTA in the Wisconsin Genetics Software Package Release 7.0, using default 
1 0 gap weights. 

Preferred variants of a given polypeptide or fragment are variants in which all amino- 
acid substitutions between the variant and the given reference polypeptide or frag- 
ment are conservative substitutions. Conservative amino-acid substitutions refer to 

15 the interchangeability of residues having similar side chains. For example, a group of 
amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and iso- 
leucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and 
threonine, a group of amino acids having amide-containing side chains is asparagine 
and glutamine; a group of amino acids having aromatic side chains is phenylalanine, 

20 tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, 
arginine, and histidine; and a group of amino acids having sulfur-containing side 
chains is cysteine and methionine. Preferred conservative amino-acids substitution 
groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine- 
valine, and asparagine-glutamine. 

25 Variants of a polypeptide or of a fragment thereof also include forms of the 

polypeptide or fragment wherein one or more amino acids have been deleted or 
inserted. Preferably, less than 5, such as less than 4, e.g. less than 3, such as less 
than 2, e.g. only one amino acid has been inserted or deleted. Variants' of a 
polypeptide or of a fragment thereof also include forms of these polypeptides or 

30 fragments modified by post-translational modifications of the amino-acid sequence. 

Recombinant cells of the invention 

In a further main aspect, the invention relates to the use of a recombinant cell 
transformed or transfected with a polynucleotide comprising a sequence encoding a 
35 polypeptide, said polypeptide comprising a sequence selected from the group 
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consisting of SEQ ID NO:1-282, preferably SEQ ID NO:16, or comprising an antigenic 
fragment or variant of said sequence. Preferably, said recombinant cell is an 
Escherichia coli or Salmonella cell, more preferably an attenuated or reduced- 
virulence Escherichia or Salmonella cell. 
5 Suitable bacterial strains for use herein have been described in e.g. Makino 

et al. (2001) Microb. Pathog. 31:1-8; Gentschev et al. (2002) Int. J. Med. Microbiol. 
291:577-582; Turner et al. (2001) Infect. Immun. 69:4969-4979; WO99/49026; and 
WO03/022307 and references therein. Examples of suitable Salmonella strains are 
CvD908-T7pol (Santiago-Machuca et al. (2002) Plasmid 47:108-119), ATCC 39183, 
10 ATCC 53647 and ATCC 53648. Examples of suitable E. coli strains are YT106 and 
E 1392/75-2 A. 

Methods and uses of the invention 

The compositions and other products defined above can be used to treat or prevent 
15 Streptococcus pneumoniae infections, and/or disease resulting from such infections, 
in animals or human beings in need thereof. 

Treatment and prevention herein include all types of therapeutic treatment and pre- 
ventive treatment and other treatments to combat Streptococcus pneumoniae, includ- 
20 ing but not limited to vaccination, prophylaxis, active immunisation, passive immuni- 
sation, administration of antibodies, curative treatment, ameliorating treatment. In 
particular, passive immunisation using antibodies of the invention is a suitable treat- 
ment for immunocompromised individuals. 

25 Diagnostic methods of the invention 

The combination of being surface-exposed and being present in relatively high copy 
numbers in cells also makes the polypeptides identified by the inventors highly suit- 
able as targets for detection of Streptococcus pneumoniae, allowing detection of this 
organism with high sensitivity. 

30 

Accordingly, in a further main aspect, the invention relates to a method for detecting 
Streptococcus pneumoniae or parts thereof in a sample comprising the steps of 
a. contacting said sample with an indicator moiety capable of specifically binding a 
polypeptide selected from the group consisting of SEQ ID NO: 1-282, preferably 
35 selected from the group consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID 
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NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most 
preferably SEQ ID NO: 16 , and 
b. determining whether a signal has been generated by the indicator moiety, thereby 
detecting whether said sample contains Streptococcus pneumoniae or parts 
5 thereof. 

Preferably, said indicator moiety is capable of binding, preferably specifically binding, 
intact Streptococcus pneumoniae cells. 

In preferred embodiments of the above diagnostic methods, a washing step is 
10 performed between the contacting step and the determination step, in order to 
improve the specificity of detection. 

The sample can e.g. be faeces, urine, a tissue, tissue extract, fluid sample or body 
fluid sample, such as blood, plasma, serum, sputum, or a sample taken from nose or 
15 lung. Another example of a sample is a food sample, such as a meat sample. 

In another aspect, the invention relates to a method for detecting Streptococcus 
pneumoniae or parts thereof in a sample comprising the step of analysing a sample 
by mass spectrometry to evaluate the presence and/or quantity of one or more of the 
20 polypeptides of SEQ ID NO:1-282, in particular SEQ ID NO:16 and/or SEQ ID NO:20. 
In one embodiment, the sample, e.g. a blood sample, is pre-treated to enrich for the 
polypeptide(s) to be detected. Such a pre-treatment may include a size-fractionation 
of proteins present in the sample. 

25 The above methods can e.g. be used to diagnose Streptococcus pneumoniae 
infections in an individual. In preferred embodiments of the above methods, said 
indicator moiety does not pass through the membrane of a Streptococcus 
pneumoniae cell. A preferred type of said indicator moiety consists of or comprises an 
antibody, such as an antibody of the invention as defined herein. 

30 

Those skilled in the art will understand that there are numerous well known clinical 
diagnostic chemistry procedures in which an indicator moiety can be used to form an 
binding reaction product whose amount relates to the amount of the ligand, herein 
Streptococcus pneumoniae or parts thereof, in a sample. Thus, while exemplary as- 
35 say methods are described herein, the invention is not so limited. 
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The present invention also relates to a diagnostic system, preferably in kit form, for 
assaying for the presence, and preferably also the amount, of Streptococcus pneu- 
moniae in a biological sample. Methods for the preparation of diagnostic kits have 
5 e.g. been described in US 5,470,958 and references therein. 

The diagnostic system includes, in an amount sufficient to perform at least 
one assay, an indicator moiety according to the present invention, preferably as a 
separately packaged reagent, and more preferably also instructions for use. Pack- 
aged refers to the use of a solid matrix or material such as glass, plastic (e.g., poly- 

10 ethylene, polypropylene or polycarbonate), paper, foil and the like capable of holding 
within fixed limits an indicator moiety of the present invention. Thus, for example, a 
package can be a glass vial used to contain milligram quantities of a contemplated 
labelled indicator moiety preparation, or it can be a microtiter plate well to which mi- 
crogram quantities of a contemplated indicator moiety has been operatively affixed, 

15 i.e., linked so as to be capable of binding a ligand. 

"Instructions for use" typically include a tangible expression describing the 
reagent concentration or at least one assay method parameter such as the relative 
amounts of reagent and sample to be admixed, maintenance time periods for re- 
agent/sample admixtures, temperature, buffer conditions and the like. 

20 

In most embodiments, the diagnostic method and system of the present invention 
include as a part of the indicator moiety, a label or indicating means capable of signal- 
ling the formation of a binding reaction complex containing an indicator moiety com- 
plexed with the preselected ligand (i.e. a polypeptide comprising any of the se- 

25 quences of SEQ ID NO: 1-282 and/or a fragment thereof). Such labels are themselves 
well-known in clinical diagnostic chemistry. 

The labelling means can be a fluorescent labelling agent that chemically 
binds to antibodies or antigens without denaturing them to form a fluorochrome (dye) 
that is a useful immunofluorescent tracer. Suitable fluorescent labelling agents are 

30 fluorochromes such as fluorescein isocyanate (FIC), fluorescein isothiocyante (FITC), 
5-dimethylamine-1-naphthalenesulfonyl chloride (DANSC), tetramethylrhodamine 
isothiocyanate (TRITC), lissamine, rhodamine 8200 sulphonyl chloride (RB 200 SC). 
Other examples of suitable fluorescent materials include umbelliferone, dichlorotriaz- 
inylamine fluorescein, dansyl chloride or phycoerythrin and the like. A description of 

35 immunofluorescence analysis techniques is found in DeLuca, "Immunofluorescence 
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Analysis", in Antibody As a Tool, Marchalonis, et al., eds., John Wiley & Sons, Ltd., 
pp. 189-231 (1982). 

Radioactive elements can be useful as labelling agents. An exemplary radio- 
labeling agent is a radioactive element that produces gamma ray emissions. Ele- 
5 ments which themselves emit gamma rays, such as 124 1, 125 l, 128 I, 132 l and 51 Cr repre- 
sent one class of gamma ray emission-producing radioactive element indicating 
groups. Particularly preferred is 125 l. Another group of useful labelling means are 
those elements such as 11 C, 18 F, 15 0 and 13 N which themselves emit positrons, or 
beta emitters, such as 111 indium of 3 H. Other suitable radioactive materials include 131 1 
10 and 35 S. 

Detection using antibodies can, in other embodiments, be facilitated by cou- 
pling the antibody to another detectable substance, such as an enzyme, a prosthetic 
group, a luminescent materials, or a bioluminescent material. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, beta-galactosidase, 
15 or acetylcholinesterase; examples of suitable prosthetic group complexes include 
Streptavidin/biotin and avidin/biotin; an example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin, and ae- 
quorin. 

In preferred embodiments, the indicating group is an enzyme, such as horse- 

20 radish peroxidase (HRP) or glucose oxidase. In such cases where the principal indi- 
cating group is an enzyme such as HRP or glucose oxidase, additional reagents are 
required to visualise the fact that a indicator-moiety/ligand complex (immunoreactant) 
has formed. Such additional reagents for HRP include hydrogen peroxide and an oxi- 
dation dye precursor such as diaminobenzidine. An additional reagent useful with 

25 glucose oxidase is 2,2'-amino-dh(3-ethyl-benzthiazoline-G-sulfonic acid). 

The linking of labels, i.e. labelling of polypeptides such as antibodies, is well 
known in the art. For instance, proteins can be labelled by metabolic incorporation of 
radioisotope-containing amino acids provided as a component in the culture medium. 
See, for example, Galfre et al., Meth. Enzymol., 73:3-46 (1981). The techniques of 

30 protein conjugation or coupling through activated functional groups are particularly 
applicable. See, for example, Aurameas, et al., Scand. J. Immunol., Vol. 8 Suppl. 7:7- 
23 (1978), Rodwell et al. (1984) Biotech. 3:889-894, and U.S. Pat. No. 4,493,795. 

Various diagnostic assays employing the above indicator moieties can be set 
up to test samples for Streptococcus pneumoniae. Exemplary assays are described 

35 in detail in Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring 
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Harbor Laboratory Press, 1988. Representative examples of such assays include: 
countercurrent immuno-electrophoresis (CIEP), radioimmunoassays, radioimmuno- 
precipitations, enzyme-linked immuno-sorbent assays (ELISA), Western blot assays, 
dot blot assays, inhibition or competition assays, and sandwich assays, immunostick 
5 (dipstick) assays, simultaneous immunoassays, immunochromatographic assays, 
immunofiltration assays, latex bead agglutination assays, immunofluorescent assays, 
biosensor assays, and low-light detection assays (see e.g. also U.S. 4,376,110 and 
4,486,530). An example of a suitable assay is an assay wherein a sample, e.g. a se- 
rum sample, is separated by electrophoresis and the polypeptide of interest, e.g. SEQ 

10 ID NO: 16, is subsequently detected by Western blotting. 

In one embodiment, the diagnostic kits of the present invention can be used 
in an "ELISA" format to detect the quantity of a preselected ligand in a fluid sample. 
"ELISA" refers to an enzyme-linked immunosorbent assay that employs an antibody 
or antigen bound to a solid phase and an enzyme-antigen or enzyme-antibody conju- 

15 gate to detect and quantify the amount of an antigen present in a sample and is read- 
ily applicable to the present methods. Thus, in some embodiments, an indicator moi- 
ety of the present invention can be affixed to a solid matrix to form a solid support that 
comprises a package in the subject diagnostic systems. A reagent is typically affixed 
to a solid matrix by adsorption from an aqueous medium although other modes of 

20 affixation applicable to polypeptides, such as antibodies, can be used that are well 
known to those skilled in the art. Useful solid matrices are also well known in the art. 
Such materials are water insoluble and include the cross-linked dextran available un- 
der the trademark SEPHADEX from Pharmacia Fine Chemicals (Piscataway, N.J.); 
agarose; beads of polystyrene beads about 1 micron to about 5 millimetres in diame- 

25 ter available from Abbott Laboratories of North Chicago, III.; polyvinyl chloride, poly- 
styrene, cross-linked polyacrylamide, nitrocellulose- or nylon-based webs such as 
sheets, strips or paddles; or tubes, plates or the wells of a microtiter plate such as 
those made from polystyrene or polyvinylchloride. 

A further diagnostic method may utilise the multivalency of an antibody com- 

30 position of one embodiment of this invention to cross-link ligands, thereby forming an 
aggregation of multiple ligands and polypeptides, producing a precipitable aggregate. 
This embodiment is comparable to the well known methods of immune precipitation. 
This embodiment comprises the steps of admixing a sample with a composition com- 
prising an antibody of this invention to form a binding admixture under binding condi- 

35 tions, followed by a separation step to isolate the formed binding complexes. Typi- 
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cally, isolation is accomplished by centrifugation or filtration to remove the aggregate 
from the admixture. The presence of binding complexes indicates the presence of the 
preselected ligand to be detected. 

Binding partners and inhibitors of polypeptides of the invention 

The surface-localisation of the polypeptides to which this invention relates makes 
them highly suitable as targets for binding partners, such as inhibitors. Surface- 
located polypeptides of a pathogenic microorganism often interact with the host or- 
ganism. Thus, any type of binding partner of a surface-located polypeptide may inter- 
fere with host-pathogen interaction. Binding partners thus often antagonise the 
pathogenicity (virulence) of a microorganism. A binding partner may also be an inhibi- 
tor of the polypeptide it binds. 

Thus, in a further main aspect, the invention relates to methods for the identification 
of binding partners of the surface-located polypeptides set forth in SEQ ID NO: 1-282. 
Such methods may be biochemical or cell-based. 

Biochemical methods 

In a main aspect, the invention relates to a method for identifying a binding partner of 
a polypeptide selected from the group consisting of SEQ ID NO: 1-282, or a fragment 
thereof, comprising the steps of 

a. providing a polypeptide selected from the group consisting of SEQ ID NO:1-282 
preferably selected from the group consisting of SEQ ID NO:16, SEQ ID NO:10, 
SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID 
NO:33, most preferably SEQ ID NO:16, 

or 

a fragment thereof, 

b. contacting said polypeptide or fragment with a putative binding partner, and 

c. determining whether said putative binding partner is capable of binding to said 
polypeptide or fragment. 

In a preferred embodiment, said putative binding partner is a host-derived molecule. 

In further preferred embodiments of the method, the polypeptide or fragment thereof 
is provided immobilised on a solid support, such as e.g. a column or microtiter plate, 
and, after the contacting step, it is determined whether or not the putative binding 



WO 2006/084467 



PCT/DK2006/000073 



62 

partner has bound to the solid support. Immobilisation of the polypeptide or fragment 
thereof may be through direct binding to the solid support, or through indirect binding 
e.g. using a specific antibody. In preferred embodiments, a washing step is performed 
between the contacting step and the determination step, in order to improve the 
5 specificity of detection. In further preferred embodiments, the putative binding partner 
is complexed with a detectable label. The putative partner may be labelled before the 
contacting takes place. Alternatively, labelling may also be performed after the 
contacting step. Furthermore, in some embodiments of this method, immobilisation 
may be performed after the polypeptide or fragment thereof has been bound to the 
10 binding partner. In preferred embodiments, the method is a screening method 
wherein the method is repeated for a plurality of putative binding partners. Suitable 
methods to determine binding are well-known in the art, and several of them have 
been referred to elsewhere herein. 

15 In another aspect, a host-derived binding partner of a polypeptide selected from the 
group of SEQ ID NO:1-282, preferably SEQ ID NO:16 may be identified as follows: 
purified host membranes are electrophoretically separated, blotted over to a 
membrane and incubated with the polypeptide of interest or fragment thereof. Binding 
can then be detected using antibodies specific for the polypeptide of interest or 

20 fragment thereof. The host binding partner to which the polypeptide or fragment 
thereof has bound can subsequently be identified by elution from the blot and 
subsequent analysis by mass spectrometry, or by any other technique known in the 
art. 

If the binding partner of a surface-located polypeptide of a pathogenic 
25 organism is a host-derived molecule, then such an interaction between the surface- 
located polypeptide and the host may be important for the virulence of the bacterium. 
Compounds that interfere with the interaction of the surface-located polypeptide and 
the host-derived binding partner may thus be suitable for prevention or treatment of 
Streptococcus pneumoniae infections. Accordingly, another method of the invention 
30 relates to a method of identifying an inhibitor of the interaction of any of the surface- 
located Streptococcus pneumoniae polypeptides of SEQ ID NO: 1-282 with a host- 
derived binding partner comprising the steps of: 

a. providing any of the polypeptides of SEQ ID NO: 1-282, preferably selected from 
the group consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID 
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NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most preferably SEQ 

ID NO:16 

or 

a fragment thereof, 

b. providing a host-derived binding partner of said polypeptide (identified as 
described above or by any other method), 

c. contacting said polypeptide with said host-derived binding partner in the absence 
of a putative inhibitor of said interaction, 

d. contacting said polypeptide with said host-derived binding partner in the presence 
of said putative inhibitor, 

and 

e. determining whether the strength of the binding of said polypeptide to said host- 
derived binding partner resulting from step d. is reduced as compared to that 
resulting from step c. 

In some embodiments, step c. and d. may be performed in two different sample 
compartments. In other embodiments, step d. may be performed by adding the 
putative inhibitor to the mixture of step c. In preferred embodiments, the method is 
repeated for a plurality of putative inhibitors. 

Of particular interest are binding partners that inhibit an activity of a surface-located 
polypeptide. Such activity may be enzymatic activity, transport activity, or any type of 
other biochemical or cellular activity, preferably enzymatic activity. 

Preferred host-derived binding partners are host polypeptides and host lipids. 
Binding may e.g. be determined as described by Szymanski and Armstrong (1996) 
Infect. Immun. 64:3467-3474. 

In preferred embodiments of the above described biochemical methods, the 
binding between the binding partner and the surface-located polypeptide or fragment 
thereof has a dissociation constant or Kd of less than 5 X 10" 6 M, such as less than 10" 
6 M, e.g. less than 5 X 10' 7 M, such as less than 10" 7 M, e.g. less than 5 X 10" 8 M, such 
as less than 10" 8 M, e.g. less than 5 X 10' 9 M, such as less than 10" 9 M, e.g. less than 5 
X 10' 10 M, such as less than 10' 10 M, e.g. less than 5 X 1CT 11 M, such as less than 10" 
11 M, e.g. less than 5 X 10" 12 M, such as less than 10" 12 M. Dissociation constants can 
e.g. be determined by surface plasmon resonance analysis. 

Cell-based methods 
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Reducing the level of a surface-located polypeptide, by deletion or disruption of the 
structural gene for it or by down-regulating gene expression (see below), may affect a 
bacterial cell. The cell may become more sensitive to cytotoxic compounds. 
Especially for surface-located polypeptides, a reduction of their level may affect the 
5 function of the cell's exterior parts, such as the membrane or cell wall, in preventing 
compounds of entering the cell. Thus, reduction of the level of an surface-located 
polypeptide can make a cell more 'permeable' for various compounds. 

Thus, an aspect of the present invention relates to a method for identifying a 
10 compound with antibacterial activity against Streptococcus pneumoniae comprising 
the steps of 

a. providing a sensitised cell which has a reduced level of any of the polypeptides of 
SEQ ID NO: 1-282, preferably selected from the group consisting of SEQ ID 
NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID 

15 NO:28 and SEQ ID NO:33, most preferably SEQ ID NO:16 

and 

b. determining the sensitivity of said cell to a putative antibacterial compound, for 
instance by a growth assay. 

Preferably, the method is a screening method wherein the procedure is repeated for a 
20 plurality of putative antibacterial compounds. Preferred putative antibacterial 
compounds are ones that do not pass through the membrane of a wild-type 
Streptococcus pneumoniae cell. 

The rationale behind this approach is that a cell with a lower level of the surface- 
25 located polypeptide will exhibit increased sensitivity to cytotoxic compounds, allowing 
identification of antibacterial compounds with low potency that are missed when using 
wild-type cells for the assay. Compounds identified by this method will be often need 
to be modified in order to improve potency. This can be done by chemical 
modification. 

30 

Inhibition of the activity of a surface-located polypeptide may affect the viability (i.e. 
survival, growth and/or proliferation) of the bacterium. Of particular interest is 
inhibition of surface-located polypeptides that are essential for viability of 
Streptococcus pneumoniae. Methods for testing essentiality of a Streptococcus 
35 pneumoniae gene have been described in the prior art, e.g. in Chan et al. (2002) J. 
Bacteriol 185:2051-2058 and Thanassi et al. (2002) Nucleic Acid Res. 30:3152-31-62. 
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Inhibitors of essential surface-located polypeptides may not need to enter the 
bacterial cell to be able to affect its viability. Thus, generally fewer requirements are 
posed on the structure of an inhibitor of an essential surface-located target 
polypeptide than on an inhibitor of an intracellular target, to be effective as an 
antibacterial agent. 

Accordingly, the invention relates to a method for identifying an inhibitor of a 
polypeptide selected from the group consisting of SEQ ID NO:1-282, comprising the 
steps of 

a. providing two cells which differ in the level of any of the polypeptides of SEQ ID 
NO: 1-282, preferably selected from the group consisting of SEQ ID NO: 16, SEQ 
ID NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and 
SEQ ID NO:33, most preferably SEQ ID NO:16 

b. determining the sensitivity of said cells to a putative inhibitor, for instance by a 
growth assay, and 

c. determining whether said two cells are differently affected by the presence of said 
putative inhibitor. 

Preferably, the method is repeated for a plurality of putative inhibitors. Preferred 
inhibitors are ones that do not pass through the membrane of a Streptococcus 
pneumoniae cell. 

The rationale behind this approach is that the viability of a cell with a lower activity of 
the essential polypeptide will be more affected by an inhibitor of the polypeptide than 
the viability of the cell with a higher level. If the two cells are differently affected, this is 
an indication that the inhibitor acts on the target or at least in the same biochemical 
pathway. 

In some embodiments of the method, the two cells with different activity of the 
polypeptide of interest are a wild-type cell (or other cell with wild-type activity of the 
gene of interest) and a sensitised cell with a reduced activity of the polypeptide of 
interest. In some embodiments, the different or reduced level in the sensitised cell 
can be a different or reduced expression level of the gene of interest (resulting in a 
different or reduced copy number of the polypeptide). This can be accomplished by 
putting the gene under control of a regulatable promoter or by regulatable expression 
of an antisense RNA which inhibits translation of an mRNA encoding the essential 
polypeptide. In other embodiments, the different or reduced activity can be a different 
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or reduced activity of the polypeptide of interest, e.g. due to a mutation, such as a 
temperature-sensitive mutation. 

Suitable ways of generating sensitised bacterial cells and of using these in screening 
for inhibitors have been described in WO 02/077183. Sensitised cells may be 
obtained by growing a conditional-expression Streptococcus pneumoniae mutant 
strain in the presence of a concentration of inducer or repressor or other conditions 
which provide a level of a gene product required for bacterial viability such that the 
presence or absence of its function becomes a rate-determining step for viability. The 
sub-lethal expression of the target gene may be such that growth inhibition is at least 
about 10%, such as at least about 25%, e.g. at least about 50%, such as at least 
about 75%, e.g. at least 90%, such as at least 95%. 

In another embodiment of the cell-based assays of the present invention, sensitised 
cells are obtained by reduction of the level activity of a polypeptide required for 
bacterial viability using a mutation, such as a temperature-sensitive mutation, in the 
polypeptide. Growing such cells at an intermediate temperature between the 
permissive and restrictive temperatures produces cells with reduced activity of the 
gene product. It will be appreciated that the above method may be performed with 
any mutation which reduces but does not eliminate the activity or level of the gene 
product which is required for bacterial viability. This approach may also be combined 
with the conditional-expression approach. In this combined approach, cells are 
created in which there is a temperature-sensitive mutation in the gene of interest and 
in which this gene is also conditionally-expressed. 

When screening for inhibitors of an essential polypeptide, growth inhibition can be 
measured by directly comparing the amount of growth, measured by the optical 
density of the culture relative to uninoculated growth medium, in an experimental 
sample with that of a control sample. Alternative methods for assaying cell 
proliferation include measuring green fluorescent protein (GFP) reporter construct 
emissions, various enzymatic activity assays, and other methods well known in the 
art. Other parameters used to measure viability include e.g. colony forming units. The 
above method may be performed in solid phase, liquid phase, a combination of the 
two preceding media, or in vivo. Multiple compounds may be transferred to agar 
plates and simultaneously tested using automated and semi-automated equipment. 
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Cell-based assays of the present invention are capable of detecting compounds 
exhibiting low or moderate potency against the target molecule of interest because 
such compounds are substantially more potent on sensitised cells than on non- 
sensitised cells. The effect may be such that a test compound may be two to several 
times more potent, e.g. at least 10 times more potent, such as at least 20 times more 
potent, e.g. at least 50 times more potent, such as at least 100 times more potent, 
e.g. at least 1000 times more potent, or even more than 1000 times more potent 
when tested on the sensitised cells as compared to non-sensitised cells. 

A mutant Streptococcus pneumoniae strain that overexpresses a surface-located 
polypeptide can also be used to identify a compound that inhibits such a polypeptide. 
If the compound is cytotoxic, overexpression of the target polypeptide can make cells 
more resistant. Thus, the invention also relates to a method for finding an inhibitor of 
any of the surface-located Streptococcus pneumoniae polypeptides of SEQ ID NO:1- 
282 comprising the steps of 

a. providing two cells which differ in the activity of any of the surface-located 
Streptococcus pneumoniae polypeptides of SEQ ID NO: 1-282, preferably 
selected from the group consisting of SEQ ID NO: 16, SEQ ID NO:10, SEQ ID 
NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most 
preferably SEQ ID NO: 16 

wherein one cell contains a substantially wild-type copy number of said 
polypeptide and the other cell contains higher than wild-type copy number of said 
polypeptide, 

b. determining the sensitivity of said cells to a putative inhibitor, for instance by a 
growth assay, and 

c. determining whether or not said two cells are differently affected by the presence 
of said putative inhibitor. 

Overexpression may be achieved using strong promoters or by introducing multiple 
copies of the structural gene for a surface-located polypeptide. As also 
overexpression of polypeptides that are not the cellular target of an inhibitor can make 
cells resistance to an inhibitor, inhibition of the target polypeptide of interest by a 
putative inhibitor will need to be verified by other means, such as e.g. a biochemical 
assay. 
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In addition to inhibitors of a biochemical or other cellular activity of a surface-located 
polypeptide, the cellular methods described above can be used to identify compounds 
that reduce the expression level of a target, and thereby its copy number, e.g. by 
interfering with gene regulation. 

5 

In preferred embodiments of the any of the cell-based- or biochemical methods for 
finding binding partners or inhibitors, the method is repeated for a plurality of 
candidate compounds. 

10 In a further aspect, the invention relates to the mutant Streptococcus pneumoniae 
strains used in the cell-based methods described herein, such as strains in which the 
gene encoding the surface-located polypeptide is placed under the control of a 
heterologous regulatable promoter, strains carrying temperature-sensitive alleles of 
the surface-located polypeptides, and strains overexpressing the surface-located 

15 polypeptides. 

Other methods of interfering with bacterial growth by targeting surface-located 
polypeptides, such as any of the polypeptides of SEQ ID NO: 1-282 include 
suppression of gene expression using specific antisense molecules, such antisense 
20 RNA or DNA, and using ribozyme molecules specific for mRNA encoding the 
essential surface-located polypeptides. 
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Example 1 
Strategy: 

The experimental steps in the project are as follows: Surface proteins were isolated 
5 by either high pH elution or by mutanolysin digestion. Isolated of surface proteins 
were identified by three complementary mass spectrometric based strategies: 1) 2-D 
SDS PAGE, 2) 1D SDS PAGE and 3) ln-solution digest. All three strategies include 
protein identification by mass spectrometry analysis. The surface identified proteins 
are cloned into an E. coli expression vector. The expressed recombinant proteins are 
10 purified and used to immunise mice to verify immunogenicity of the antigens. The 
immunised mice are used in challenge studies in which the mice are challenged with 
Streptococcus pneumoniae and protection against disease and/or death is monitored. 

Mice: 

15 Six-weeks-old BALB/c mice were housed under specific-pathogen-free conditions and 
given sterile food and water ad libitum. 

Bacteria: 

Escherichia coli Top10 (Invitrogen) was used as the host for routine plasmid cloning. 

20 Recombinant proteins were expressed in E. coli BL21/(DE3) (Invitrogen.). E. coli were 
cultured in Luria broth supplemented with antibiotics. Virulent S. pneumoniae strain 
D39 (serotype 2, purchased by Dr. M. Trombe, CICT, Toulouse, France) was used for 
proteomics, challenge experiments and as a source of genomic DNA for PGR amplifi- 
cation experiments. Clinical isolates of S. pneumoniae, including 40 serotypes re- 

25 sponsible for the majority of pneumococcal infections were selected and purchased 
from the WHO Collaborating Centre for Reference and Research on Pneumococci in 
Copenhagen, Denmark. S. pneumoniae were routinely grown on blood agar plates 
(Difco). 

30 Isolation of S. pneumoniae cell envelope fraction: 

Mutanolysin digestion of pneumococcal cell walL Bacteria were grown overnight on 
blood agar plates, harvested into phosphate-buffered saline (PBS) containing 20% 
sucrose and pelleted by centrifugation at 6000g for 10 minutes. The pellet was resus- 
pended in 0.5 ml of osmotic digestion buffer (20% sucrose in 20 mM Tris-HCI, pH 7.0, 
35 10 mM MgCI 2 , protease inhibitor cocktail and 100 U/ml mutanolysin (Sigma) per plate. 
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Enzymatic digestion was allowed to proceed for 1-2 h at 37°C. The intact protoplasts 
were removed by centrifugation at 7,000 x g for 15 min. The supernatant was col- 
lected, acetone precipitated and analysed using mass spectrometry based tech- 
niques. 

5 High pH elution of surface proteins. Bacteria were grown overnight on blood agar 
plates, harvested into PBS containing 20% sucrose and pelleted by centrifugation at 
6000g for 10 minutes. The pellet was resuspended in PBS containing 20% sucrose 
and centrifuged again as above. Then bacteria were resuspended in 2 ml of 50 mM 
glycine-NaOH (pH 12) containing 20% sucrose per plate. Alkali extraction of cell sur- 
10 face proteins was allowed to proceed for 30 minutes at room temperature with gentle 
shaking. The suspension was centrifuged at 15,000gfor 20 min, the supernatant was 
collected, adjusted to pH 7 with 1 M HCI, acetone precipitated and analysed by 1-D 
and 2-D gel electrophoresis. 

15 Surface Protein Identification: 

The complex mixture of proteins obtained after surface extraction was analysed by 
three complementary strategies all based on mass spectrometry: 2D gel based strat- 
egy, 1 D gel based strategy and In-solution digest strategy. 

20 2-D gel based strategy (2D-gel MALDI- TOF/T OF) : Two-dimensional gel electrophore- 
sis was performed either on the Ettan Dalt 2 system (Amersham Biosciences) or on 
the Novex NuPage system (Invitrogen) according to the manual provided with the gel 
system. In brief: First dimension runs were performed on either 7 cm or 24 cm pre- 
cast IPG strips (pH range 3-10 or 4-7) using the Ettan IPGphor isoelectric focusing 

25 system (Amersham Biosciences) according to the manufacturer's instructions. Isofo- 
cusing was performed at the following conditions: 7 cm strips: 8000 Vh, 24 cm strips: 
52000Vh. The second dimension was performed using pre-cast 12.5 % gels (Amer- 
sham Biosciences) at 5W per gel for 15 min then total 170 W for 4-6 hours for 24 cm 
strips. The 7 cm strips were run on the Novex NuPage system (Invitrogen) using pre- 

30 cast 4-12% gels (Invitrogen) at 200 volts for 40 minutes. Gels were silver stained ac- 
cording to a modified method described originally by Mortz et al. (2001) Proteomics 
1(1 1), 1359-1363, and spots for mass spec analysis were picked using the Ettan Spot 
Picker from Amersham according to the manufacturer instructions. 

Specific protein spots were spot-picked, and placed in Milli-Q water. These 

35 gel plugs were washed in 50mM NH4HCO3 / 50% ethanol and dehydrated by incuba- 
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tion in 96% ethanol. Reduction and alkylation was performed by incubating in reduc- 
ing solution (10 mM DTT, 50 mM NH4HCO3) at 56°C followed by a room temperature 
incubation in alkylation solution (55 mM iodoacetamide, 50 mM NH4HCO3) in the 
dark. Two cycles of washing and dehydration were then performed prior to the addi- 
5 tion of 5 ul trypsin solution (12.5 ng/ul Promega trypsin in 50 mM NH4HCO3, 10% 
Acetonitrile). Then an additional amount of sodium bicarbonate solution was added 
and the digests were incubated overnight at 37°C. Trifluoroacetic acid was added to 
the overnight digest followed by incubation with shaking. 

Parts of the extract were used in MALDI-TOF peptide mass fingerprint and 

10 MALDI-TOF/TOF analysis (Ultraflex, Bruker Daltonics, Germany) and the peak-lists 
were used in database searching against a specific S. pneumoniae database. The 
Mascot search program and scoring algorithm (Matrix Science, UK) was used in da- 
tabase searching. Peptide mass tolerance was set to 60 ppm and 0.7 Da, respec- 
tively. Search parameters were adjusted to include oxidation of Met, the addition of 

15 Carbamidomethyl groups to Cys, and trypsin was allowed to miss one cleavage site 
per peptide. 

1D gel based strategy (GeLC-MS/MS): One-dimensional gel electrophoresis was per- 
formed on a Novex NuPage system (Invitrogen) according to the manual provided 

20 with the gel system. In brief we used, size 8 cm x 8 cm, 1 mm thick pre-cast 12% bis- 
tris gels (Invitrogen) at 200 volts for 60 minutes in NuPage-MOPS-SDS running 
buffer. Gels were silver stained according to a modified method described originally 
by Mortz et al. (2001) Proteomics 1(11), 1359-1363. Whole lanes were cut out with 
razorblade in 0,5 cm gel slices. The gel slices were digested as described under 2D 

25 gel bases strategy, but the amount of trypsin was 20 ul trypsin solution (12.5 ng/ul 
Promega trypsin in 50 mM NH4HCO3, 10% Acetonitrile). 

The extracts was analysed on a LC-MS/MS (Waters Cap-LC and Micromass 
Ultima TOF MS). Each extract was submitted to a 115 minutes LC-MS/MS analysis. 
The peak lists generated from the fragmented peptides were used in database 

30 searching against a specific Streptococcus pneumoniae database. The Mascot 
search program and scoring algorithm (Matrix Science, UK) was used in database 
searching. Peptide mass tolerance was set to 200 ppm and 0.4 Da for fragment ions. 
Search parameters were adjusted to include oxidation of Met, the addition of Car- 
bamidomethyl groups to Cys, and trypsin was allowed to miss one cleavage site per 

35 peptide. 
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ln-solution based strategy (ISD-MS/MS): The protein mixture was resuspended in 50 
mM NH4HCO3, 10% Acetonitrile. Trypsin solution (50 |nl) (12.5 ng/pl Promega trypsin 
in 50 mM NH4HCO3, 10% Acetonitrile) was added and the mixture was incubated 
5 overnight at 37°C. The digestion was stopped by acidification with TFA (final cone. 
1%) and analysed by LC-MS/MS as described under ID-based strategy. Database 
search was also performed as described under 1 D-based strategy. 

Detection of genes for protein vaccine candidates in different S. pneumoniae 
10 strains: 

PCR amplification was used to demonstrate the presence of genes encoding antigens 
listed in clinical isolates of S. pneumoniae. For this purpose cells were grown on 
blood agar and diluted in PBS. Genomic DNAs were prepared from 40 pneumococcal 
strains by heating (95°C for 5 min) and aliquots were used as templates for PCR am- 
15 plification with Taq polymerase (Qiagen) with gene specific primers. Amplification 
products were electrophoresed through 1% agarose gels and visualized by staining 
with ethidium bromide (0.5 pg/nil). 

Reverse Transcription Polymerase chain reaction (RT-PCR): 

20 A BALB/C mouse was infected with S. pneumoniae D39 as described below under 
Pneumococcal challenge. After 1 day of infection this mouse was sacrificed and the 
spleen was extracted and divided in two pieces. For isolation of intact total bacteria 
RNA from tissue, one half the organ was frozen quickly in liquid nitrogen and stored 
at - 80° C before RNA isolation. The other half of each organ was tested for bacteria 

25 using blood agar plates before RNA isolation (data not shown). Total RNA was iso- 
lated from animal tissue containing S. pneumoniae with the Rneasy Kit (Qiagen, Hil- 
den). First-strand cDNA synthesis was performed with total RNA and the iScript Kit 
(Biorad). 1 pi cDNA were used for the subsequent PCR-step with gene specific prim- 
ers. 

30 

Production of recombinant vaccines (rec. vac): 

The production of recombinant vaccine was achieved by PCR amplification of pneu- 
mococcal genes, with subsequent cloning and expression of the genes in E. coli. Oli- 
gonucleotide primers used in PCR amplification experiments were all purchased from 
35 MWG, Germany. Pneumococcal genes used for protein expression were amplified 
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from genomic DNA of S. pneumoniae strain D39 by using the high-fidelity thermosta- 
ble DNA polymerase, Platinum Pfx (Life Technologies). The coding sequence was 
amplified with primers containing the start-codon but excluding the stop-codon of the 
open reading frame. The coding sequences used for protein expression were cloned 
5 into plasmid pET!01 (invitrogen) using directional Topo cloning kit, with E. coli Top10 
as the bacterial host. A plasmid-encoded C-terminal polyhistidine tag flanks each re- 
combinant protein. For recombinant protein expression, each recombinant pET!01 
plasmid was subcloned into the E. coli strain BL21 (DE3). Recombinant protein ex- 
pression was initiated by induction with IPTG (isopropyl-fc-D-thiogalactopyranoside), 

10 and proteins were purified from the soluble fraction of recombinant E. coli lysates by 
using metal affinity chromatography resin and buffers (invitrogen), according to the 
manufacturer's instructions. Protein concentrations were estimated by using the BCA 
test (Hercules, Calif.). The recombinant proteins were dialyzed against PBS (Millipore) 
and stored at ~80°C. Recombinant Proteins were identified with MALDI MS technol- 

1 5 ogy after purification. 

Western Blot analysis: 

Purified proteins were separated on one-dimensional (ca. 20pg protein) SDS-PAGE 
and transferred to PVDF membranes. The immunological detection of immobilized 
20 proteins was performed after manufactures instructions (invitrogen). Patient sera 
were kindly provided by Dr. M. Trombe (CICT, Toulouse, France). We used a dilution 
of 1:100 with single and pooled patient sera for immunodetection. The dilution of the 
secondary antibody was 1 :5000. 

25 Pneumococcal challenge of actively immunized mice: 

BALB/C mice were used in challenge experiments (10 mice per group). For an anti- 
gen specific vaccination mice were primed with 25pg of each antigen containing the 
ALUM adjuvant (100pg) on day zero. Animals were boosted with the same antigen 
concentration on day 14 and day 28. We used untreated BALB/c mice and mice 

30 treated with an unrelated cellulose-binding domain CBD (sigma) as negative controls. 
All vaccines were administered subcutaneously (s.c). All mice were bled on days 0, 
21 and 36 and challenged on day 35 with S. pneumoniae D39. Individual sera from 
each immunized mouse were tested for the presence of specific antibodies prior to 
challenge. Virulent S. pneumoniae (D39) grown on blood agar plates was prepared 

35 for challenge via the intra peritoneal (i.p.) route in actively immunized mice or control 
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groups. For challenge infections, mice were injected i.p. with approximately 10 7 CFU 
per mouse of virulent S. pneumoniae strain D39 suspended in PBS. The actual num- 
ber of CFU administered was determined retrospectively by plating serial dilutions of 
the inocula on blood agar. The survival of mice was monitored for 7 days, at which 
5 time the experiments were terminated. 



ELISA for detection of immunoglobulin G (IgG) mouse: 

Elisa assays were developed for detection of antigen specific IgG in mouse sera at 
day 0, day 21 and day 35 after vaccination. Different ELISA plates were coated with 

10 recombinant vaccines (2 |jg) and whole cell lysates (2 jjg) of S. pneumoniae D39. 
Two fold serial dilutions were made from mouse sera as a primary antibody. The dilu- 
tion of the secondary antibody (goat anti mouse IgG, horse radish peroxidase conju- 
gated) was 1:5000. OPD substrate was used for the color development. Plates were 
read at 492 nm. Results of IgG status at day 21 and day 35 after vaccination were 

15 compared with IgG status at day 0 of vaccination. 



Results: 

We identified 282 different polypeptides in mutanolysin created cell-surface fractions 
of S. pneumoniae. Sequences of the identified polypeptides are given in figure 2. The 

20 methods that were employed identifies polypeptides that are expressed at a relatively 
high level. We used three different strategies for the detection of isolated S. pneumo- 
niae surface proteins: a) a 2-D gel based strategy, b) a strategy with 1-D gels and 
LCJVIS/MS and c) an in-solution based MS strategy. We selected ORFs of identified 
surface proteins for characterization. ORFs of genes were amplified with PCR and 

25 cloned directional into pET101. Recombinant proteins were expressed in E. coli as 
described in materials and methods. Four recombinant proteins were selected for 
further studies (table 3): 

_ „ AnrP number _ . t . 

Group# (antigen# , shorfc#) Descrxption 

1 23 0653 (1, 653) hydrolase, putative [Streptococcus pneu- 

moniae TI; ca. 32 kDa (rec. vac.) 

2 516029 (2, 029) lipoate-protein ligase, putative [Strep- 

tococcus pneumoniae TIGR4] ; ca. 4 0 kDa 
(rec . vac . ) 

3 800607 (3, 607) ATP-dependent Clp protease, proteolytic 

subunit; ca. 24 kDa (rec. vac.) 

4 944060 (4, 060) autoinducer-2 production protein; 

21 kDa (rec. vac.) 
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A PCR investigation demonstrated that genes of each of these 4 selected proteins 
are presented in 40 different serotypes of S. pneumoniae (table 4). 

Table 4: PCR with different S. pneumoniae serotypes and specific primers for 
following genes (+ = detected; - = not detected) 
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For identification of transcripts of these 4 genes we made a RT-PCR analysis. A 
BALB/C mouse was infected with S. pneumoniae and total RNA was isolated from 
spleen after day 1 of infection. RT-PCRs with RNAs from spleen demonstrated that 
the selected genes are expressed in S. pneumoniae after animal infection (figure 3). 
5 In addition Western Blots (WB) with patient sera were made for testing the immuno- 
genicity of proteins listed in table 3. Furthermore 3 additional proteins (144, 
AnrP454144 (14 kDa rec. vac); 487, AnrP98487 (32 kDa); 646; AnrP373646 (25 
kDa)) were also tested for immunogenicity with Western Blot. We detected recombi- 
nant vaccines 029, 060, 607, 646 and 653 in immunoblots (WBs) with sera isolated 
10 from a single patient (figure 4) or pooled from different patients (data not shown). In 
WBs unspecific signals or no signals were detected for purified proteins 144 and 487, 
however, lack of a signal does not exclude that these two proteins may be suitable as 
vaccines. 

15 We vaccinated mice with proteins in table 3 as a protein vaccine and tested the pro- 
tection efficiency against S. pneumoniae. Antigens were prepared with alum and in- 
jected subcutaneously into BALB/C mice at three time points (day 0, 14, 28). For 
negative controls, mice were left untreated (group 5) or treated with lOOpg alum 
(group 6) or with an unrelated protein (group 7). We tested the immunogenicity of 

20 each protein at day 0, day 21 and day 35 of vaccination with an ELISA assay. Mice 
produced immunoglobulin (IgG) against antigens 029, 060, 607, and 653 (figure 5). 
No immune response against pneumococcal antigens was detected as expected in 
animals of groups 5,6 and 7. For the bacterial challenge, each mouse was infected 
with S. pneumoniae D39 (10 7 CFU per mouse) at day 35 of vaccination. Mice derived 

25 from two groups, vaccinated with proteins 029 and 607 demonstrated a lower mortal- 
ity and a lower CFU-titre after infection with this S. pneumoniae strain (figure 6, figure 
7). Proteins 060 and 653 also showed a trend towards lower CFU-titre (figure 6). 

In order to investigate variations in sequence between different strains of Streptococ- 
30 cus pneumoniae, the sequences of antigens 029 and 607 were partially determined 
from serotypes 15b, 15c, and 35f, and from Streptococcus pneumoniae strain D39. 
These sequences were compared with database sequences of 029 and 607 from 
type 4 (TIGR) and R6 (Sanger Center). The region of 029 from amino acid position 1 
to amino acid position 315 showed more than 98% sequence identity on amino acid 
35 level between the six strains. For 607, more than 98% sequence identity on amino 
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acid level between the six strains was found in the region from position 20 to position 
190. These data indicate that 029 and 607 are well-conserved across different 
strains. 

5 The structure of antigen 029 (SEQ ID NO: 16) (putative lipoate protein ligase) has 
been determined and is accessible under accession number 1VQZ in the PDB (Pro- 
tein Data Bank) and HSSP databases of EBI, the European Bioinformatics Institute. 
Surface-exposed regions were predicted by identification of amino acids with high 
water accessibility (ACC), which at the same time should have a low variability (VAR) 
10 in sequence (see table below). The amino acid stretches of different areas separated 
by several amino acids are adjacent in the 3D structure and are therefore paired to- 
gether. 

AMINO ACIDS exposed on the surface: 
15 155-160 + 185-191 [DLSVLA (SEQ ID NO:289) / IINELPK (SEQ ID NO:290)] 
127-128 + 166-180 [IDG / SKDKFESKGVKSVRA (SEQ ID NO:291)] 
195-204 + 207-211 [VEKFRDLLLE (SEQ ID NO:292) / KKEYP (SEQ ID NO:293)] 



Amino acid 
No. 


Amino 
acid 


ACC 


VAR 


127 


I 


7 


21 


128 


D 


139 


26 


129 


G 


36 


18 










155 


D 


78 


20 


156 


L 


90 


24 


157 


S 


72 


35 


158 


V 


9 


49 


159 


L 


53 


14 


160 


A 


83 


42 










166 


S 


15 


37 


167 


K 


102 


35 


168 


D 


59 


37 


169 


K 


74 


7 


170 


F 


34 


22 
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171 


E 


92 


41 


172 


S 


86 


26 


173 


K 


39 


13 


174 


G 


64 


10 


175 


V 


78 


11 


176 


K 


96 


28 


177 


S 


77 


0 


178 


V 


100 


11 


179 


R 


102 


20 


180 


A 


86 


32 










185 


I 


0 


14 


186 


I 


4.4 


48 


187 


N 


110 


32 


188 


E 


51 


46 


189 


L 


18 


18 


190 


P 


102 


42 


191 


K 


70 


36 










195 


V 


1 


43 


196 


E 


84 


26 


197 


K 


112 


39 


198 


F 


0 


22 


199 


R 


37 


43 


200 


D 


72 


40 


201 


■ 

L 


33 


42 


202 


■ 

L 


0 


25 


203 


L 


24 


38 


204 


E 


116 


46 










207 


K 


65 


51 


208 


K 


102 


41 


209 


E 


81 


50 


210 


Y 


42 


50 


211 


P 


107 


46 
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Claims 

1 . A composition comprising 

- an antibody capable of binding the polypeptide of SEQ ID NO:20 or capable of 
binding a polypeptide selected from the group consisting of SEQ ID NO:1-19 
or from the group consisting of SEQ ID NO:21-282, or 

- a polypeptide which comprises a sequence selected from the group consisting 
of the polypeptides of SEQ ID NO:1-19 or from the group consisting of SEQ ID 
NO:21-282, or comprises an antigenic fragment or variant of said sequence, 
or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 
for use as a medicament. 

2. The composition of claim 1 , comprising or further comprising 

- a polypeptide which comprises SEQ ID NO: 16, or comprises an antigenic 
fragment or variant of SEQ ID NO: 16, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, or 

an antibody capable of binding said polypeptide. 

3. The composition of any of the preceding claims, comprising or further comprising 

- a polypeptide which comprises SEQ ID NO: 10, or comprises an antigenic 
fragment or variant of SEQ ID NO: 10, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, or 

an antibody capable of binding said polypeptide. 

4. The composition of any of the preceding claims, comprising or further comprising 
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a polypeptide which comprises SEQ ID NO: 13, or comprises an antigenic 
fragment or variant of SEQ ID NO: 13, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, or 

an antibody capable of binding said polypeptide. 

5. The composition of any of the preceding claims, comprising or further comprising 

- a polypeptide which comprises SEQ ID NO:28, or comprises an antigenic 
fragment or variant of SEQ ID NO:28, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, or 

an antibody capable of binding said polypeptide. 

6. The composition of any of the claims 2 to 5, further comprising 

- a polypeptide which comprises SEQ ID NO:20, or comprises an antigenic 
fragment or variant of SEQ ID NO:20, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, or 

an antibody capable of binding said polypeptide. 

7. The composition of any of the preceding claims, comprising or further comprising 
an antibody capable of binding the polypeptide of SEQ ID NO:26. 

8. The composition of any of the preceding claims, comprising or further comprising 
an antibody capable of binding the polypeptide of SEQ ID NO:33. 

9. The composition of any of the preceding claims, wherein the variant has at least 
95%, such as at least 96%, e.g. at least 97%, such as at least 98%, e.g. at least 
99% sequence identity to said sequence. 
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10. The composition of any of the preceding claims, wherein the antigenic fragment 
comprises less than 99%, such as less than 75%, e.g. less than 50%, such as 
less than 25%, e.g. less than 20%, such as less than 15%, or e.g. less than 10% 
of the full-length of said sequence. 

1 1 . The composition of any of the preceding claims, wherein the antigenic fragment 
comprises less than 70 consecutive amino acid residues, e.g. less than 50, such 
as less than 40, e.g. less than 30, such as less than consecutive 20 residues of 
said sequence. 

12. The composition of any of the preceding claims, wherein the antigenic fragment 
comprises 6 or more, such as 7 or more, e.g. 8 or more, such as 9 or more, e.g. 
10 or more consecutive amino acids of said sequence. 

13. The composition of any of the preceding claims, wherein the composition 
comprises two or more of the polypeptides of SEQ ID NO: 1-282, preferably two or 
more of the polypeptides of SEQ ID NO:1-41, such as any of the compositions of 
figure 1. 

14. The composition of any of the preceding claims, wherein the polypeptide 
comprises a tag, such as a histidine tag. 

15. The composition of any of the preceding claims, wherein the recombinant cell is 
an attenuated or reduced-virulence Escherichia coli cell or an attenuated or 
reduced-virulence Salmonella cell. 

16. The composition of any of the preceding claims, wherein the recombinant cell is 
alive. 

17. The composition of any of the preceding claims, wherein the recombinant cell is 
dead. 

18. The composition of any of the preceding claims, wherein the medicament is a 
vaccine. 
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19. The composition of any of the preceding claims, wherein the composition 
comprises an immunogenic carrier, such as a carrier protein, wherein the 
immunogenic carrier preferably is bound to said polypeptide. 

20. The composition of any of the preceding claims, wherein the composition 
comprises an adjuvant. 

21. The composition of any of the preceding claims, wherein the antibody furthermore 
is capable of binding an intact Streptococcus pneumoniae cell. 

22. The composition of any of the preceding claims, wherein the antibody is 
polyclonal. 

23. The composition of any of the preceding claims, wherein the antibody is 
monoclonal. 

24. The composition of any of the preceding claims, wherein the antibody is a human 
antibody or humanised antibody. 

25. The composition of any of the preceding claims, wherein the antibody is a binding 
fragment of an antibody. 

26. The composition of any of the preceding claims, wherein the antibody has a 
dissociation constant or Kd less than 5 X 10" 6 M, such as less than 10" 6 M, e.g. less 
than 5 X 10" 7 M, such as less than 10 7 M, e.g. less than 5 X 10" 8 M, such as less 
than 10" 8 M, e.g. less than 5 X 10" 9 M, such as less than 10" 9 M, e.g. less than 5 X 
10" 10 M, such as less than 10" 10 M, e.g. less than 5 X 1(T 11 M, such as less than 10" 
11 M, e.g. less than 5 X 10" 12 M, such as less than 10 _12 M, e.g. less than 5 X 10" 13 M, 
such as less than 10" 13 M, e.g. less than 5 X 10" 14 M, such as less than 10" 14 M, e.g. 
less than 5 X 10 _15 M, or less than 10~ 15 M. 

27. The composition of any of the preceding claims, wherein the composition 
comprises a pharmaceutically-acceptable carrier. 
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28. The composition of any of the preceding claims, wherein the composition is 
suitable for systemic administration, 

29. The composition of any of the preceding claims, wherein the composition is 
5 suitable for intravenous, intramuscular, or subcutaneous administration. 

30. The composition of any of the preceding claims, wherein the composition is 
suitable for oral administration. 

10 31. The composition of any of the preceding claims, wherein the composition is 

suitable for intranasal administration. 

32. An antibody capable of binding a polypeptide selected from the group consisting 
of SEQ ID NO:1-282, preferably selected from the group consisting of SEQ ID 

15 NO:1-41, more preferably selected from the group consisting of SEQ ID NO:20, 

SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:26, SEQ ID NO:28 
and SEQ ID NO:33, most preferably the polypeptide of SEQ ID NO:16. 

33. The antibody of claim 32, wherein the antibody furthermore is capable of binding 
20 an intact Streptococcus pneumoniae cell. 

34. The antibody of any of claims 32 to 33, comprising the features of any of claims 
22 to 26. 

25 35. A recombinant cell transformed or transfected with a polynucleotide comprising a 

sequence encoding a polypeptide, said polypeptide comprising 
- a sequence selected from the group consisting of SEQ ID NO:1-19 or from the 
group consisting of 21-282, preferably selected from the group consisting of 
SEQ ID NO: 1-1 9 or from the group consisting of 21-41, more preferably 
30 selected from the group consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID 

NO:13, SEQ ID NO:28, most preferably SEQ ID NO:16, 
or 

an antigenic fragment or variant of said sequence. 
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36. The recombinant cell of claim 35, wherein the recombinant host cell is an 
Escherichia coli or Salmonella cell. 

37. The recombinant cell of claim 35 or 36, wherein recombinant the cell is an 
5 attenuated or reduced-virulence cell. 

38. Use of a composition comprising 

a polypeptide which comprises a sequence selected from the group consisting 
of SEQ ID NO:1-19 or from the group consisting of SEQ ID NO:21-282, or 
1 0 comprises an antigenic fragment or variant of said sequence, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector, 

15 for the preparation of a medicament for the immunisation of an animal or human 

being against bacterial, preferably Streptococcus, more preferably Streptococcus 
pneumoniae, infections. 

39. The use of claim 38, wherein the immunisation induces a protective immune 
20 response. 

40. The use of claim 38 or 39, wherein the medicament is a medicament suitable for 
parenteral, intravenous, intramuscular, subcutaneous, oral or intranasal 
administration. 

25 

41. The use of any of claims 38 to 40, wherein the composition comprises or further 
comprises 

a polypeptide which comprises SEQ ID NO:16, or comprises an antigenic 
fragment or variant of SEQ ID NO: 16, or 
30 a polynucleotide comprising a sequence encoding said polypeptide, or 

an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 
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42. The use of any of claims 38 to 41, wherein the composition comprises or further 
comprises 

- a polypeptide which comprises SEQ ID NO: 10, or comprises an antigenic 
fragment or variant of SEQ ID NO:10, or 

5 - a polynucleotide comprising a sequence encoding said polypeptide, or 

- an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

10 43. The use of any of claims 38 to 42, wherein the composition comprises or further 

comprises 

- a polypeptide which comprises SEQ ID NO: 13, or comprises an antigenic 
fragment or variant of SEQ ID NO: 13, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
15 an expression vector comprising a sequence encoding said polypeptide, or 

- a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

44. The use of any of claims 38 to 43, wherein the composition comprises or further 
20 comprises 

- a polypeptide which comprises SEQ ID NO:28, or comprises an antigenic 
fragment or variant of SEQ ID NO:28, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
25 a recombinant virus or recombinant cell comprising said polynucleotide or said 

expression vector. 

45. The use of any of claims 38 to 44, wherein the composition further comprises 

- a polypeptide which comprises SEQ ID NO:20, or comprises an antigenic 
30 fragment or variant of SEQ ID NO:20, or 

a polynucleotide comprising a sequence encoding said polypeptide, or 
an expression vector comprising a sequence encoding said polypeptide, or 
a recombinant virus or recombinant cell comprising said polynucleotide or said 
expression vector. 

35 
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46. Use of an antibody capable of binding a polypeptide selected from the group 
consisting of SEQ ID NO:1-282, preferably an antibody as defined in any of claims 
32 to 34, for the manufacture of a medicament for the treatment or prevention of 
Streptococcus, preferably Streptococcus pneumoniae, infections in an animal or 
human being. 

47. The use of claim 46, wherein the polypeptide is selected from the group consisting 
of SEQ ID NO:1-41, preferably selected from the group consisting of SEQ ID 
NO:16, SEQ ID NO:10, SEQ ID NQ:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID 
NO:28 and SEQ ID NO:33, most preferably wherein the polypeptide is SEQ ID 
NO: 16. 

48. A method for raising antibodies to a polypeptide selected from the group 
consisting of SEQ ID NO: 1-282 in a non-human animal comprising the steps of 

a. providing 

- a polypeptide comprising 

a sequence selected from the group consisting of SEQ ID NO:1- 
282, preferably selected from the group consisting of SEQ ID NO:1- 
41, more preferably selected from the group consisting of SEQ ID 
NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID 
NO:26, SEQ ID NO:28 and SEQ ID NO:33, most preferably SEQ ID 
NO:16 
or 

an antigenic fragment or variant of said sequence, 

- a polynucleotide comprising a sequence encoding said polypeptide, 

- an expression vector comprising a sequence encoding said polypeptide, 
or 

- a recombinant virus or recombinant cell comprising said polynucleotide or 
said expression vector, 

b. introducing a composition comprising said polypeptide, polynucleotide, vector, 
recombinant virus or recombinant cell into said animal, 

c. raising antibodies in said animal, and 

d. isolating and optionally purifying the antibodies. 
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49. A method for generating antibodies capable of binding an intact Streptococcus 
pneumoniae cell comprising performing the steps specified in claim 48 and the 
further step of selecting antibodies capable of binding an intact Streptococcus 
pneumoniae cell. 

50. The method of claim 48 or 49, wherein the animal is a transgenic animal capable 
of producing human antibodies. 

51. A method for detecting Streptococcus pneumoniae or parts thereof in a sample 
comprising the steps of 

a. contacting said sample with an indicator moiety capable of specifically binding 
a polypeptide selected from the group consisting of SEQ ID NO:1-282, 
preferably selected from the group consisting of SEQ ID NO:1-4l, more 
preferably selected from the group consisting of SEQ ID NO: 16, SEQ ID 
NO: 10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and 
SEQ ID NO:33, most preferably the polypeptide of SEQ ID NO:16, 

and 

b. determining whether a signal has been generated by the indicator moiety, 
thereby detecting whether said sample contains Streptococcus pneumoniae or 
parts thereof. 

52. The method of claim 51, wherein the indicator moiety furthermore is capable of 
binding intact Streptococcus pneumoniae cells. 

53. The method of any of claims 51 or 52, wherein said indicator moiety does not 
pass through the membrane of a Streptococcus pneumoniae cell. 

54. The method of any of claims 51 to 53, wherein said indicator moiety consist of or 
comprises an antibody, such as an antibody as defined in any of claims 32 to 34. 

55. A method for detecting Streptococcus pneumoniae or parts thereof in a sample 
comprising the step of analysing said sample by mass spectrometry to evaluate 
the presence and/or quantity of one or more of the polypeptides of SEQ ID NO:1- 
282. 
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56. A method for identifying a binding partner of a polypeptide selected from the 
group consisting of SEQ ID NO: 1-282 or a fragment thereof, comprising the steps 
of 

5 a. providing a polypeptide selected from the group consisting of SEQ ID NO:1- 

282, preferably selected from the group consisting of SEQ ID NO: 1-41 , more 
preferably selected from the group consisting of SEQ ID NO: 16, SEQ ID 
NO:10, SEQ ID NO:13, SEQ ID NO:20, SEQ ID NO:26, SEQ ID NO:28 and 
SEQ ID NO:33, most preferably SEQ ID NO:16 
10 or 

a fragment thereof, 

b. contacting said polypeptide or fragment with a putative binding partner, and 

c. determining whether said putative binding partner is capable of binding to said 
polypeptide or fragment. 

15 

57. A method for identifying a compound with antibacterial activity against 
Streptococcus pneumoniae comprising the steps of 

a. providing a sensitised cell which has a reduced level of a polypeptide selected 
from the group consisting of SEQ ID NO: 1-282, preferably selected from the 

20 group consisting of SEQ ID NO: 1-41, more preferably selected from the group 

consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:20, 
SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most preferably the 
polypeptide of SEQ ID NO:16, and 

b. determining the sensitivity of said cell to a putative antibacterial compound, for 
25 instance by a growth assay. 



58. A method for identifying an inhibitor of a polypeptide selected from the group 
consisting of SEQ ID NO:1-282, comprising the steps of 

a. providing two cells which differ in the level of a polypeptide selected from the 
30 group consisting of SEQ ID NO: 1-282, preferably selected from the group 

consisting of SEQ ID NO:1-41, more preferably selected from the group 
consisting of SEQ ID NO:16, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:20, 
SEQ ID NO:26, SEQ ID NO:28 and SEQ ID NO:33, most preferably the 
polypeptide of SEQ ID NO:16, 
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b. determining the sensitivity of said cells to a putative inhibitor, for instance by a 
growth assay, and 

c. determining whether said two cells are differently affected by the presence of 
said putative inhibitor. 

5 

59. The method of claim 58, wherein the putative inhibitor does not pass through the 
membrane of a Streptococcus pneumoniae cell. 
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Fig.l 
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Fig. 2 - Sequence listing 

SEQ ID NO:l 
>AnrPlO3029 

MAVFEKVQEIIVEELGKDASEVTLESTFDDLDADSLDLFQVISEIEDAFDIQIEAENDLKTVGDLVAYVE 
EQAK 

SEQ ID NO: 2 
>AnrPl52053 

MVIjPNFKENLEKYAKIjLVANGIIWQPGHTIjAIjS IDVEQRELAHL IVKEAYALGAHEVI VQWTDDVINREK 

FLHAPMERLDWPEYKIAEMJXrZLLENKA 

KVSWTVAAAAGLEWAKKVFPNAASDE 

SALHYTAPGTDLTLGLPKNHWESAGAWAQGEEFLPNMPTEEVFTAPDFRRAD 

EGIKVTFKDGQIVDITAEKGDQVMKDLVFENAGARALGECALVPDPSPISQSGITFFNTLFDENASNHLiA 
IGAAYATSWDGAEMSEEELEAAGLNRSDVHVDFMIGSNQMDIDGIREDGTRVPLFRNGNWAN 

SEQ ID NO: 3 
>AnrPl53 057 

MNKRVQAFLAKMQEKELDGI I INNLKNVYYLTGFWGSNGTVF I SRDRQVLVTDSRYI IAAKQETSGFEIV 
ADRDEL AVI AG I VKDMGLTRI GFED E I S VS YYHRMQAAF AGLDLL P QTQF VEGLRMI KDE AE I AAI RKAC 
SISDQAFRDALDFIKPGKTEIEIANFLDFRMRELGASGLSFDTILASGINSSKPHAHPMHKPVELGEAIT 
MDFGCLYDHYVSDMTRTIYLGHVSDEQAEIYNTVLKANQALIDQAKAGLGFRDFDKIPRDIIIEAGYGDY 
FTHGIGHGIGLDIHEEPYFSQTSTETIKTGMVLTDEPGIYIEGKYGVRIEDDILITETGCELLTLAPKEL 

IVI 

SEQ ID NO: 4 
>AnrP154031 

MIYAGILAGGTGTRMGlSNLPKQFLELGDRPILIHTIEKFVLEPSIEKIWGVHGDWVSHAEDIiVDKYLP 

L YKERI 1 1 TKGGADRNT S I KNI I EAIDAYRPLTPED I WTHDS VRPF I TLRMI QDNI QLAQNHDAVDTVV 
EAVDTIVESTNGQFITDIPNRAHLYQGQTPQTFRCKDFMDLYGSLSDEEKEILTDACKIFVIKGKDVALA 

KGEYSNLKITTVTDLKIAKSMIEKD 

SEQ ID NO: 5 
>AnrPl69731 

MGFTEETVRFKLDDSNKKEISETLTDWASLNDKGYNPINQIVGYVLSGDPAWPRYNNARNQIRKYERD 
EIVEELVRYYLKGQGVDL 

SEQ ID NO: 6 
>AnrP201747 

MQAVEHFIKQFVPEHYDLFLDLSRETKTFSGKVTITGQAQSDRISLiHQKDLEITSVEVAGQARPFTVDHD 

NEALHIELAEAGQVELVLAFSGKITDNMTGIYPSYYTVDGVKKEVIiSTQFESHFAREAFPCVDEPEAKAT 

FDIiSLRFDQAEGELALSNMPEIDVENRKETGIWKFETTPRMSSYLLAFVAGDLQGVTAKTKNGTLVGVYS 

TKAHPLSNLDFSLDIAVRSIEFYEDYYGVKYPIPQSLHIALPDFSAGAMENWGLVTYREVYLW 

ASRQQVALWAHELAHQWFGNLVTM 

DATDGVQSVHVEVKHPDEINTLFDGAIVYAKGSRLMHMLRRWLGDADFAKGLHAYFEKHQYSNTIGSDLW 
DALGQASGRDVAAFMDSWLEQPGYPVLTVKVENDVLKISQKQFFIGENEDKNRLWWPLNSNWKGLPDTL 
ETESIEIPGYAALLAENEGAIiRLNTENTAHYITDYQGDLLEAVLAELETLDNTSKLQIVQERRLLAEAGH 
ISYADLLPVLDKLAKEESYLWSAVSQVISALERFIDEGTDAETAFKGLVAKLARHNYDRLGFEAKDGES 
DEDELVRQLAVSMMIRSNDAEASQVASQIFATHKENLAGLPAAIRSQVLINEMKHHETKDLLALYLDTYT 

HATDAVFKRQLAAALAYSTDADNIQNLITSWKDKFVW 

KAALGGDMSFDSFVILPAHVFKTQQRLAEYKEFFEPQLSDLALSRNIGMGIKEIAARVDLISREKAAVEA 
WLQYGNA 

SEQ ID NO: 7 
>AnrP204323 

MEFEEKTLSRKEIYQGPIFKLVQDQVELPEGKGTAQRDLIFHNGAVCVLAVTDEQKLILVKQYRKAIEAV 
SYEIPAGKLEVGENTAPVAAALRELEEETAYTGKLELIiYDFYSAIGFCNEKDKLYLASDLTKVENPRPQD 

EDETLEVLEVSLEEAKELIQSGHICDAKTIMAVQYWEIiQKK 

SEQ ID NO: 8 
>AnrP313097 

MSNI YDS ANEIi S RGLRGL PE YKAVKAAKD AI AAD AE AS K I F TD YL AF QE E I QKL AQTGQMPDAS F Q AKME 
GFGKQIQGNSLLSEFFTKQQQLAIYIiSDIEKIVFEPVSEIiLK 
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SEQ ID NO: 9 
>AnrP322675 

MI KI LAAC GAGVNS S HQ I KS ALEEEL SNRGYDVHCDAVMVKDVNEDLMKGYD I FT P I AATDLGF E PGI P V 
I E AGP I LFRI P AMS AP VFDNI EAAI KEHGL S 

SEQ ID NO: 10 
>AnrP373 646 

MI FTYNKEHVGDVL MVI VKNS GDAKLDVERKGKVARVFLKDNGETVAWNI F EVS SLF E I AERGQVFLTDE 
QVARLNQELQAEGFTEEIVNDKEPKFWGEIVEMVAHPDSDHLNICQVAVASDKTVQIVAGAPNARVGLK 
TIVALPGAMMPKGNLIFPGELRGEKSFGMMCSPRELHLPNAPQKRGVLELSEDQWGTPFDPAKHWTA 

SEQ ID NO: 11 
>AnrP406411 

MFEIFKSYQFNQEKAHDYGFIENSEVWTYSCQILQGDFVMTVSITADNVSFQVFDQETGDLiYPHVYMESM 
RGSFVGNVREACLEILYQIRKACFDVQDFICHQTKRIMTQVQEKYGNQLEYLWEKSPDTAVLRHEGNQKW 
YAVLMKISWNKLEKGREGQVEAVNLKHDQVANIjLSQKGIYPAFHMSKRYWI SVSLDDTLSDEEVLELIEK 
SWNLTSKK 

SEQ ID NO: 12 
>AnrP428269 

MEQFLDNIKDLEVTTWRAQEAIiDKKETATFFIGRKTCPYCRKFAGTLSGWAETKAHIYFINSEEPSQL 
NDLQAFRSRYGIPTVPGFVHITDGQINVRCDSSMSAQEIKDFAGL 

SEQ ID NO: 13 
>AnrP454144 

MQNMMRQAQKLQKQMEQSQAELAAMQFVGKSAQDLVQATLTGDKKYVSIDFNPAWDPEDLETLSDMTVQ 
AINSALEQIDETTKKKLGAFAGKLPF 

SEQ ID NO: 14 
>AnrP4893 96 

MSQEFINPSDGVIRQYLATSKTIiAWGLSDREETTSNRVTKEMQARGYKIIPVNPKAAGGEILGEKAYAS 
LAEIPFPVDIVNVYRRSEFLPDVARDFLKADAKIFWAQLGLESLEAKEILRDGGCDDIVMNRCIKREHTR 

LIEEA 

SEQ ID NO: 15 
>AnrP494708 

MQVLLFCCNIFYTSfNERVLEIIjRKRRHIMSKKVLFIVGSLRQGSFNHQMAL 

PLFSQDLEVPTHPAVAAAREAVLVADAIWIFSPVYNFSIPGTVKNLLDWLSRALDLSDTRGVSALQDKF^ 
TVSSVANAGHDQLFAIYKDLLPFIRTQGVGDFTAARVNDSAWADGKLVLEETVLNSLEKQAQDLVEAIK 

SEQ ID NO: 16 
>AnrP516029 

MKYI INHSNDTAFNIALEEYAFKHLLDEDQIFLLWINKPS I IVGRHQNTIEEINRDYVRENGIEWRRI S 
GGGAVYHDLNNLNYTIISKEDENKAFDFKSFSTPVINTLAQLGVKAEFTGRNDLEIDGKKFCGNAQAYIN 
GRIMHHGCLLFDVDLSVIiANALKVSKDKFESKGVKSVRARVTNIINELPKKITVEKFRDLLLEYMKKEYP 
EMTEWFSEEELAEINRIKDTKFGTWDWNYGKSPEFWRRGIKFTSGKVEVFANVTESKIQDIKIYGDFF 
GI EDVAAVEDVLRGVKYEREDVLKALKT I D I TRYF AG I S REE IAEAWG 

SEQ ID NO: 17 
>AnrP56981 

MVELNLKNIYKKYPNSEHYSVEDFNLNIKDKEFIVFVGPSGCGKSTTLRMIAGLEDITEGTASIDGWVN 
DVAPKDRDIAMVFQNYALYPHMTVTDNMAFGIiKLRKYSKEDINKRVQEAAEIIiGLKEFLERKPADLSGGQ 
RQRVAMGRAIVRDAKVFLMDEPLSNLDAKLRVSMRAEIAKIHRRIGATTIYVTHDQTEAMTLADRIVIMS 
ATKNPAGTGTIGRVEQIGTPQEVYKNPVN 

REKGYEGKELIFGIRPEDVNAEPAFLETFPDCWKATISVSELLGSESHDYCQVGKDEFVAKVDARDYLQ 
TGATVELGFDLNKAHFFDVETEKTIY 

SEQ ID NO: 18 
>AnrP594255 

MIISEQSDFKRYASWKYFSKVCDFLENTNLTDLVDGKIDIDGEWFANCMTYLADGVPGDIFETHKKYLi 
DIHIWENTEKMAVTSPVRAQSRVPFSEEKDIAFYDSKDYQIVELIiPGNMLVTFEEDLHQPKIHCNDETV 
KKLVIKVLNEEK 
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SEQ ID NO: 19 
>AnirP7 32933 

MFVNFFHEKEKIMRYDFGKVYKEIRESKGLTQEEVCGGVLS 

FEEFEYICQLYQPSQRTEIMQTYLNMRSIIGTSDLVNLFQKCQDYLKTHHDLPIEEIRDMLEWIYLRQH 

GAGELSKHAEQVVKKLWKKIEKQDTWYESDLKI^ 

MTILLNLSTLYLYNQDKNIY^ 

SML S HLKKEVE I YYQAKER 

SEQ ID NO: 20 
>AnrP800607 

MIPWIEQTSRGERSYDIYSRLLKDRIIMLTGPVEDNMAN^ 

AGL AI VDTMNF I KADVQT IVMGMAASMGTVI AS SGAKGKRFMLPNAEYMI HQPMGGTGGGTQQTDMAI AA 
EHLLKTRNTLEKILAENSGQSMEKVHADAERDNWMSAQETLEYGFIDEIMANNSLN 

SEQ ID NO: 21 
>Ari2rP 8 34127 

MRIKWFSLIRIIGLLLVLLYHFFQTIFPGGFFGV 

RI VP PVVLMVLVTMPFTFLVRQDYVAGIGGQ I AGVLGFMTNF YELLTGGS YE S QF I PHLFVHNWSLAVEV 
HYYILWGLAWFLSKQAKSNGQLKGMVFLLSAVAFLISFFSMFIGSFLVTSYSSVYFSSLTHVYPFFLGS 

MLATIVGVRQTTSLVKQLDKIWDLRKTL^ 

ARVLHEKTHHIQESKIISFLADTSYAVYLFHWPFYIIFSQXiTSNLIiAVLLTLIC 
AGKNTPIVQTLRPIjPYIHAILAAGTGILTIIVCTV 

ADANSLGI ADGTML I GD S VALRANTALQTAL PGAQ INAQVSVTTKTANE IMLNNS QNKFL PKTVVT ATGV 
NNPENYKDDWDSIVKNLPKGHHMILVTPYEGDKTKETYAIVEKAAAYMRELAEKTPYITIADW^ 

PE IWAGTDQVHFGS ES ST I E AGAKL YADT I ATALQTAQDKPVKSK 

SEQ ID NO: 22 
>AnrP856854 

MIELKQVSKSFGERELFSNLSMTFEAGKVYALIGSSGSGKTTLMNMIGKIjEPYDGTIFYRG 
DFFRHELGYLFQNFGLIENQSIEENLKLGLIGQKLSRSEQRLRQKQALEQVGLVYLDLDKRIFELSGGES 

qrvalakiilknppfiladeptasidpatsqlimeillslrddnrliiiathnpaiwemadevf™ 

SEQ ID NO: 23 
>AnrP859722 

MLRSQFEEDLEKLHNQFYAMGQEVLSQINRTVRAFVTHDRDLAKEVIEDDAEVNEYEVKLEKKSFEMIAL 
QQPVSQDLRTVLTVLKAVSDVERMGDHAVAIAQATIRMKGEERIPAVEEEIKKMGREVKSWEAALDLYL 
NGSVDDAYRVASMDEQINHYFETIRDLATEEIKKNPEAIVTGRDYFQVISYLERIGDYAKNICEWVVYFE 

TGKIVEL 

SEQ ID NO: 24 
>AnrP871789 

MSKENPLSHHEQLRYDYLLKNIHYLNEREKNEFAYLQEKLTLARGNSSSSLEQEREEQVDLPSYANRSRS 

Q S KS QAL S F P PKKKRRKLRLKRI FMVI F S LLVC VALAMVFMFLRGYQDAS AKKTADARAAQVEVFNGQDT 

RDGWILIMGTDGRIGQNSVETRTDSIMVLNVGGSDKKMKLVSFMRDNLWIDGYSQVINGRKQTDNKLN 

VAYELGEQEGQKGAEMVRQVLKDNFDLDIKYYALVDFQAFATAIDTLFPDGVTIDAQFSTLNGRPLTEAT 

VGDDLYATETESPTQTIKVGKQQMNGSTLLNYARFRDDDEADYGRTKRQQQVLTAILEQIKDPTKL 

EALGKVFAMTSTNVPYTFLLTNGLiSVLDGAKNGIEKLTIPELGDWVDAYDVYGGLGLLVDQNKYQTKL^ 

MGLR 

SEQ ID NO: 25 
>AnrP884020 

MPNYNIPFSPPDITEAEITEVVDTLRSGWITTGPKTK 

VGPGDEVIVPAMTYTASC SVITHVGATPVMVDIQADTFEMDYDLLEQAITEKTKVI I PVELAGIVCDYDR 
LFQWEKKRDFFTASSKWQKAFNRIVIVSDSAHALGSIYKGQPSGSIADFTSFSFHAVKNFTTAEGGSAT 

WKANPVIDDEEMYKEFQILSLHGQTKDAIiAKMQLGSWEYDIVTPAYKCNM 

RRKDIVDRYDSGFAGSRIHPLAHKTETVESSRHLYITRVEGASLEERNLIIQELAKAGIASNVHYKPLPL 
LTAYKNLGFDMTNYPKAYAFFENEITLPLHTKLSDEEVDYIIETFKTVSEKVLTLSKK 

SEQ ID NO: 26 
>AnrP944060 

MS KEVI VE S FELDHT I VKAP YVRL I GE ETGPKGD 1 1 SNYD I RLVQ PNEDS I PTAGLHT I EHLLAKL I RTR 

IDGMIDCSPFGCRTGFHMIMWGRHTSAKIAA 

WAKL I LEQGI S DDAF ERHVI 
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SEQ ID NO: 27 4 *'^ / 
>AnrP948024 

MRAQSFFLTFSFIRSKIKLALNKGVLNMIEITYIDASKNERTVTFESYEDFERSQQACLIGVADYYPVQK 
LTYKGHNLDYHGTYGDIFFYLMKQDLSQYN 

SEQ ID NO: 28 
>AnrP98487 

MTIKLVATDMDGTFLDGNGRFDMDRIiKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDIIFIAENGS 
LVEYQGQDLYEATMSRDFYLATFEKLKTSPYVDINKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVA 
SLEDITDDIFKFTTNFTEETLEDGEAWVNENVPGVKAMTTGFESIDIVLDYVDKGVAIVELVKKLGITM 
QVMAFGDNLNDLHMMQWGHPVAPENARPEILELAKTVIGHHKERSVIAYMEGL 

SEQ ID NO: 29 
>AnrPl07243 

MVLPNFKENLEKYAKLLVANGINVQPGHTLALSIDVEQRELAHLIVKEAYALGAHEVIVQWTDDVINREK 

FLHAPMERLDNVPEYKIAEMNYLLENKASRLGVRSSDPGAIjNGVDADKLSASAKAMGLAM 

KVSWTVAAAAGIjEWAKKVFPNAASDEEAVDIjLWDQIFKTCRVYEADPV 

SALHYTAPGTDLTIiGLPKNHVWESAGAWAQGEGFLPNMPTEEVFTAPDFRRADGYVTSTKPLSYNGNII 

egikvifkdgqivditaekgdqvmkdlvfenagaralgecalvpdpspisqsgitffntlfdenasnhla 
igaayatslvdgaemseeeleaaglnrsdvhvdfmigsnqmdidgiredgtrvplfrngnwan 

SEQ ID NO: 30 
>AnrP118660 

MATTESLGRRRGNRRAYLSIDKKELiSRYNLGSCFLIIDKIMEVHMKTISLVYISLSGNTESFVTRLKDYL 
LSQYKRIEVQKIHIKDLVKEGKNFYEMDHPYVAFLPTYLEGGNGVDNGDVEILTTPVGDFIAYGNNASKC 
FGWGSGNRNFNNQYCLTAKQYSQRFGFPVLADFEMRGMLEDIKHVAAIIADLYELEKEN 

SEQ ID NO: 31 
>AnrPl313 54 

MDTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIF 
FHTMCEHHFLPFYGRAHIAYI PDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADAIiMDYLGAKGAFVI I 
EAEHMCMSMRGVRKPGTATLTTVARGLFETDKDLRDQAYRLMGL 

SEQ ID NO: 32 
>AnrPl59129 

MKIKKLLKMVT PVLMISAVGTTFVEANQIGAFSNFVITTSYKRTGYLTKENEGAEYIMNLNPCRNLHPMT 
VKHRIWSNGEARSGESLTTCGTRSTHGNWATVGYWAADMARQNWWDLSAAISGSWSPN 

SEQ ID NO: 33 
>AnrP230653 

MIFDTHTHLNVEEFAGREAEEIAIjAAEMGVTQMN^ 

TEEVEAYLLDKLKHSKWALGEIGLDYHWMTAPKEVQEQVFRRQIQLSKDLDLPFVVHTRDALEDTYEII 
KSEGVGPRGGIMHSFSGTLEWAEKFVDLGMTISFSGWTFKKATDLQEAAKELPLDKMLVETDAPYLAPV 
PKRGRENKTAYTRYVVDFIADLRGMTTEELAVATTANAERIFGLDSK 

SEQ ID NO: 34 
>AnrP310966 

MNTNLASFIVGLIIDENDRFYFVQKDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKIiRLTTLEVTATQDQ 
FGWGRVTEVRKDLGVF VDTGL PDKE IWSLD I L PELKELWPKKGDQLY I RLEVDKKDRI WGLLAYQEDF Q 
RLARPAYNlSnyiQNQNWPAIVYRLKLSGT^ 

LSLKPRSFEMLENDAQMILTYLESNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMKAGKIKQDQFG 
TEL I 

SEQ ID NO: 35 
>AnrP332314 

MNSDVLiEFLRTETAEKI SLYI S EANRLEGDVTLLAPNS QDLEDI KNAMIiSNSNLGLKVARLDVMKKI AYA 
S TRNH YLTGAT I FGD I S KGTYNCD PKS YV 
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SEQ ID NO: 36 
>AnrP384168 

MSKLQQIIiTYLESEKLDVAWSDPVTINYLTGFYSDPHERQMFLFVLADQEPLLFVPALEVERASSTVSF 

P WGYVDS ENPWQKI KHAL P QLDF KRVAVEFDNL I LTKYHGLKTVF ETAEFDNLT PRI QRMRL I KS ADEV 
QKMMIAGLYADKAVWGFDNISLDKTETDIIAQIDFAMKREGYEMSFDTMV^ 

NDALLLFDLGVLWGYASDMTRTV^^ 

AGYGEYFNHRLGHGIGMDVHEFPSIMEGNDMVIEEGMCFSAEPGIYIPGKVGVRIEDCGVV 
STSKDLLYFD 

SEQ ID NO: 37 
>AnrP468792 

MKIDITNQVKDEFLISLKTLISYPSVLNEGENGTPFGQAIQDVLEKTLEICRDIGFTTYLDPKGYYGYAE 
IGQGAELLAILCHLDWPSGDEADWQTPPFEATIKDGWVFGRGVQDDKGPSLAALYAVKSLLDQGIQFKK 
RVRFIFGTDEETLWRCMARYNTIEEQASMGFAPDSSFPLTYAEKGLLQVKLHGPGSDQLELEVGGAFNW 
PDKANYQGLLYEQVCNGLKEAGYDYQTTEQTVTVLGVPKHAKDASQGINAVIRLATILAPLQEHPALSFL 
ATQAGQDGTGRQIFGDIADEPSGHLSFNVAGLMINHERSEIRIDIRTPVLADKEELVELLTRCAQNYQLR 
YEEFDYLAPLYVAKDSKLVSTLMQ I YQEKTGDNS PAI S SGGATFARTMPNC VAFGALF PGAKQTEHQANE 
CAVLEDLYRAMD I YAEAVYRLAT 

SEQ ID NO: 38 
>AnrP540869 

MTVTIDWENLGFSYMKLPYRYLAHFKNGQWDQGELTEDATLHISESSPSLHYGQQAFEGLKAYRTKDGSV 
QLFRPDENAKRLQRTCDRLLMPQVPTDMFVEACKAWRANEEYVPIiYGIGGTLYLRPLIiIGVGDIIGVKP 
AEEYIFTIFAMPVGNYFKGGLVPTNFLIQDEYDRAAPNGTGAAKVGGNYAASLLPGKI^KSRHFSDVIYIi 
DPSTHTKIEEVGSANFFGITADNEFVTPLSPSILPSITKYSIiLYLAEHRLGLTPIEGDVPIDNLDRFVEA 

GACGTAAVISPIGGIQHGDDFHVFYSETEVGPVTRKLYNELTGIQFGDIEAPEGWIVKVD 
SEQ ID NO: 39 

>AnrP578829 ^ _._ _ 

MINILAACGAGWSSHQIKSALEEELSNRGYDVHCDAVMVKDWEDLMKGYDIFTPIAATDLGFEPGIPV 

IEAGPILFCIPAMSVPVFDNIRLPAKQNMV 

SEQ ID NO: 40 
>AnrP578945 

MT I VGC RI DGRL I HGQVANLWAGKLNVS RIIWVDDEVVNND I EKS GIjKIjAT P PGVKL S I L PVEKAAANI L 
GGKYDSQRLFIVARKPDRFLGLVEAGVPLETLNVGNMSQTPETRSITRSINWDKDVEDFHKLAEKGVKL 

TAQMVPNDPISDFLSLLK 

SEQ ID NO: 41 
>An:rP982 843 

MKIALINENSQASKNHIIYDSLKEATDKKDYQLFNYGMRGEEGESQLTWQNGLMAAILLNTKAVDFWT 

GC GTGVGAMLALNS F PGWC GL AVD PTDAYLYS Q INGGNAL S I P YAKGFGWGAELTLKLMFERL F AEEMG 
GGYPRERVIPEQRNARILNEVKQITHNDLMTILKTIDQDFLKDTISGKYFQEYFFENCQDDEVAAYLKEV 

LAK 

SEQ ID NO: 42 
>AnrPll0506 

MSSHPIQVFSEIGKLKKAmLHRPGKELENLLPDYLERLLFDDIPFLEDAQKEHDAFAQALRDEGIEVLYL 

EQLAAESLTSPEIRDQFIEEYLDEANIRDRQTKVAIRELLHGIKDNQELVEKTMAGIQKVELPEIPDEAK 

DLTDLVESEYPFAIDPMPNLYFTRDPFATIGNAVSLNHMFADTRNRETLYGKYIFKYHPIYGGKVDLVYN 

REEDTRIEGGDELVLSKDVLAVGISQRTDAASIEKLLVNIFKKNVGFKKVLAFEFA 

VDYDKFTIHPEIEGDLHVYSVTYENEKLKIVEEKGDIjAEL^ 

GSNTLTIAPGWWYDRNTVTNKILEEYGLRIilKIRGSELVRGRGGPRCMSMPFEREEV 

SEQ ID NO: 43 
>ArnrP 1 2 7 4 9 0 

MTKALISIDYTEDFVADSGKLTAGAPAQAISDAISKVTRLAFERGDYIFFTIDAHEENDCFHPESKLFPP 
HNLIGTSGR3STLYGDLGIFYQEHGSDSRVFWMDKRHYSAFSGTDLDIRLRERRVSTVILTGVLTDICVLHT 

AI DAYNLGYD I E I VKPAVAS I WPENHQF ALGHFKNTLGAKIjVDENIiNEL S E 
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SEQ ID NO: 44 
>AnrPl32965 

MTDNFFGKTLAARKVEAI PGMLEFDI PVHGDNRGWFKENFQKEKMLPLGFPESFFAEGKLQN1XIVSFSRKN 
VLRGLHAEPWDKYISVADGGKVLGSWVDLREGETFGNTYQTVIDASKGIFVPRGVANGFQVLSDTVSYSY 

LVNDYWALELKPKYAFVNYADPSLGIEWENIAEAEVSEM 

SEQ ID NO: 45 
>AnrP17099 

MTQGKITASAAMLNVLKTWGVDTIYGIPSGTLSSLMDALAEDKDIRFLQVRHEETGALAAVMQAKFGGSI 
GVAVGSGGPGATHLINGVYDAAMDNTPFLAILGSRPVNEL^^ 

PKVIDEACRAAISKKGPAWEIPWFGFQEIDENSYYGSGSYERSFIAPALNEVEIDKAVEILNNAERPV 
IYAGFGGVKAGEVITELSRKIKAPIITTGKNFEAFEWNYEGLTGSAYRVGWKPANEWFEADTVLFLGSN 
FAFAEVYEAFKNTEKFIQVDIDPYKLGKRHALDASILGDAGQAAK 

RDYMNKDEGKTEGELQLYQVYNAINKHADQDAIYSLDVGSTTQTSTRHLHMTPKNMWRTSPLFATMGIAL 
PGGIAAKKDTPDRQVWNIMGDGAFNMCYPDVITNVQYDLPVINLVFSNAEYGFIKNKYEDTNKHLFGVDF 
TNADYGKIAEAQGAVGFTVDRIEDIDAWAEAVKLNKGGKTWIDARITQHRPLPVEVLELDPKLHSEEA 

IKAFKEKYEAEELVPFRLFLEEEGLQSRAIK 

SEQ ID NO: 46 
>AnrPl73 5 01 

MAKAITDATFEQETKDGLVLVDFWATW^ 

S I PTLLFKKDGQWKQVAGVHTAEQIKAI I aels 

SEQ ID NO: 47 
>ArnrPl743 54 

MTSLKLLKEKAPLVI C I TND WKNFTANGLVALGAS PAMS EF P ADLEDLLKYAGGLL INI GTLTDENWKL 
YQAALKIAEKYWPAVLDPVACGAGEYRKWADDLINNYKLAAIRGNAGEIASLVGIDVASKGVDSAGVD 

NIDE I ALAANEKFNI P I VVTGEVDAI AVNGEVVMI HNGS AMMPKVI GTGCLLGAVVAS F IGLEKGQELKS 
LKTAVLVYNIAGEIAEKRPNGHLPGTFKVEFINALYEITDEDVKEFKRVK 

SEQ ID NO: 48 
>AnrPl89426 

MKNNRILAIiSGNDIFSGGGIiSADLATYTLNGLHGFVAVTCLTALTEKGFEVFPTDDTIFQHELDSLRDVE 
FGGIKIGLLPTVSVAEKALDFIKQRPGVPWLDPVLVCKETHDVAVSELCQELIRFFPYVSVITPNLPEA 
ELLSGQEIKTLEDMKTAAQKLHDLGAPAVIIKGGNRLSQDKAVDVFYDGQTFTILENPVIQGQNAGAGCT 
FASSIASHLVKGDKLLPAVESSKAFVYRAIAQADQYGVRQYEANKNN 

SEQ ID NO: 49 
>AnrP216529 

MEQTFFIIKPDGVKRGLVGEVIiKRIEQRGFTIEKLjEFRSQVSEELIDQHYQDIiVGQSFYPPIREFMTSGP 
VLVGVISGPWIETWRTMMGATRPEEALPGTIRGDFAKAAGENEIIQNVWGSDSEESAKREIALWF 

SEQ ID NO: 50 
>AnrP240537 

MMSQKIIGIDLGGTSIKFAILTTAGEIQGKWSIKTNILDEGSHIVDDMIESIQHRLDLLGLAAADFQGIG 

MG S P G WDRDKGT VI GA YNIiNWKTL Q P I KQ K I E KAL G I P F F I DND ANVAAL GE RWMG AGDNQ PD WF I TL 
GTGVGGGIVAEGKLLHGVAGAAGELGHITVDFDQPISCTCGKKGCLETVASATGIVNLTRRYADEYEGDA 
ALKRLIDNGEEVTAKTVFDLAKEGDDLALIVYRNFSRYLGIACANIGSIIiNPSTIVIGGGVSAAGEFLLQ 

GVQKVYDENS F P QVRT STKLALATLGNDAGVI GAASLVLQ 

SEQ ID NO: 51 
>AnrP272457 

MRIAIGCDHIVTDEKMAVSEFLKSKGYEVIDFGTYDHTRTHYPIFGKKVGEAVTSGQADLGVCICGTGVG 

INNAVNKVPGVRSALVRDMTTALYAK^ 
IEHVESHNAQQTDANFFTEFLEKWDRGEYHD 

SEQ ID NO: 52 
>AnrP278845 

MIYTVTLNPSIDYIVRLDQVKVGSWRMDSDDKFAGGKGINVSRVLKRLNIPNTATGFIGGFTGKFITDT 
LAEEEIETRFVQVAEDTRINVKIKADQETEINGTGPTVEPVQLEELKAILSSLTAEDTWFAGSSAKNLG 
NVIYKDlilSLTRQTGAQWCDFEGQTLIDSLDYQPLIiVKPNNHELGAIFGVKLESLDEIEKYARELLAKG 
AQNVI I SMAGDGALLVTS EGAYF AKP I KGTVKNS VGAGD SMVAGFTGEF VKS KDAVEAFKWGVAC GTATT 
FSDDLATAEFIKETYGKVEVEKR 
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SEQ ID NO: 53 ' 
>AnrP290066 

MSYQDLKECKIITAFITPFHEDGSINFDAIPALIEHLLDHHTDGIL,LAGTTAESPTLiTHDEELELFAAVQ 
KIWGRVPLIAGVGTNDTRDSIEFVKEVAEFGGFAAGLAIVPYYNKPSQEGMYQHFKAIADASDL.PIIIY 

NIPGRVVVELTPETMLRLADHPNIIGVKE 
VASHTNGDEMHEMFIAIAESDVKKAAAIQRKFIPKVN^ 
EEDAKRI I KVWDGDYEATKATVTGVLRPD Y 

SEQ ID NO: 54 
>AnrP294752 

MSHIKFDYSKVLDKFVAPHEVEYMQSQVTAADELIRKGTGAGSDFLGWLDLPEKYDREEFDRILKAAEQI 
KSDSDVLWIGIGGSYLGAKAAIDFLNHHFAN^ 

NVISKSGTTTEPAIAFRVFKELLVKKYGQEEANKRIYATTDRQKGAVKVEADANGWGTFWPDDIGGRFS 
VLTAVGLLPIAASGADIKALMEGANAARKDYTSDKISENEAYQYAAVRNILYRKGYATEILVNYEPSLQY 
FSEWWKQLAGESEGKDQKGIYPTSANFSTDLHSLGQFIQEGTRIMFETVVRVDKPRKNVLIPTIjEEDLDG 
LGYLQGKDVDFVNKKATDGVLLAHTDGDVPNMYVTLPEQDAFTIiGYTIYFFELAIALSGYLNAINPFDQP 

GVE AYKRNMF ALLiGKPGF E ELS KELNARL 

SEQ ID NO: 55 
>AnrP309710 

MALTEQKRVRLEKLSDENGI I S ALAFDQRGALKRLMVKHQTEEPTVAQMEELKVLVADELTKYAS SMLLD 
PEYGLPATKALDEKAGLLLAYEKTGYDTTSTKRLPDCLDVWSAKRIKEEGADAVKFIiLYYDVDSSDELNQ 

EKQAYI ERI GS EC VAED I PFFL EI L AYDEKI ADAGS VE YAKVKPHKVT GAMKVF SD PRFNIDVLKVEVPV 
NIKWEGFAEGEVVYTREEAAAFFKAQDEATNLPYIYLSAGVSAKLFQDTLVFAHESGANFNGVLCGRAT 

WAGS VE AY I KDGE AAAREC VRTTGFENI DELNKVLQRTAT SWKERV 

SEQ ID NO: 56 
>AnrP32 013 

MTFLNKIHETATFLKEKGIAAPEFGLILGSGLGELAEEIENPVWDYAEIPNWGRSTWGHAGKLVYGEL 
AGRKVLALQGRFHFYEGNPLEVVTFPVRVMKVLG 

GENLDDFGPRFPDMSRAYTPEYRATAHEVAKKLNIKLDEGVYIGVTGPTYETPAEIRSYKTLGADAVGMS 
TVPEVIVAAHSGLKVLGlSCITNFAAGFQEELNHEEWEVTERVKGDFKGIiLKAILAEIi 

SEQ ID NO: 57 
>AnrP335459 

MLLIKNGRWDPKSGLDQVCDVLVQDGKIIKIASEITEEGAETIDATGLWAPGLVDIHWFREPGQTHK 
EDIHTGALAAAAGGFTTVVMMANTSPTISDVETLQAVLQSAAKEKINVKTVATIT 

EAGAVGFSDDGIPLESSKIVKEAMEEAKKLNTFlSLHEEDPGLNGVIiGFNENIAREHFHICGATGVAEYA 
MMARDVMIAYATKAHVHIQHLS 

LESDRRAVIEGLKSGVITVIATDHAPHHVDEKWEDITKAPSGM 

EKMTYNPAKLYNFEAGYLAENGPADITIFDAKADRFVDSHFASKAANSPFIGETLKGQVKYTICKGQIVY 
QA 

SEQ ID NO: 58 
>AnrP337646 

MTLWQSTRDANNTVTASQAILQGLATDGGLFTPDTYPKVDDNFDKLKDASYQEVAKLVLSAFLDDFTVE 
ELDYCINNAYDSKFDTPAIAPLVKLDGQYNLELFHGSTI 

ATSGDTGKAAI^GFANVPGTEIIVFYPKDGVSKIQELQMTTQTGDNTHVIAIDGNFDDAQTNVKHMFNDV 
ALREKLTTNKLQFS SANSMNIGRLVPQI VYYVYAYAQLVKTGE I VAGEKVNFTVPTGNFGNILAAFYAKQ 
IGLPVGKLICASNDNNVLTDFFKTRVYDKKREFKVTTSPSMDILVS SNLERXjIFHLLGNNAEKTTELMNA 
LNTQGQYKLTDFDAEILDDFAAEYATEEETAAEIKRVCELDSYIEDPHTAVASAVYKKYQSATGDVTKTV 
I ASTAS PYKF P WAVE AVTGKAGLTDF EALAQLHE I S GVAVP P AVDGLE I AP I RHKTTVAAADMQAAVE A 
YLGL 

SEQ ID NO: 59 
>AnrP3 82550 

MKTIQIAIDGPASSGKSTVAKIIAKDFGFTYLDTGAMYRAAT^ 

RSETGDQLVFVGDVDITHPIRENEVTNHVSAIAAIPQVREKLVSLQQEIAQQGGIVMDGRDIGTVVLPQA 
ELKIFLVASVDERAERRYKENIAKGIETDLETLKKEIAARDYKDSHRETSPLKQAEDAVYLDTTGLNIQE 

WEKIKAEAEKRM 
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SEQ ID NO: 60 * m 
>AnrP388835 

MAKTIHTDKAPKAIGPYVQGKIVG^^ 

HWKTTC FLSDMNDF VPFNEVYQTAFKEEF P ARS AVEVARL PRDVKVE I E VI AE I G 

SEQ ID NO: 61 
>AnrP392175 

MTNQNYLAKTTNKQYIVKFFGKGTEKL 

DSTSIKTKFDKITPILQTIHTSAKELRGEFAPFEEIKKYESLIEEQIPYANYESVRNAVFSLEKRLADLG 

VDRKSCHIDLVPENFIESPQGRLYLIDWEYSSMNDPMWDIiAALFLESEFTSQEEETFL 

EKIAIYKILQDTIWSLWTVYKEEQGEDFGDYGWRYQRAIKGLASYGGSDEK 

SEQ ID NO: 62 
>AnrP398243 

MDLTKRFNKQLDKIQVSLIRQFDQAISEIPGVLRLTLGEPDFTTPDHVKEAGKRAIDQNQSYYTGMSGLL 
TLRQAASDFVKEKYQLDYAPENEILVTIGATEALSATLTAILEEGDKVLLPAPAYPGYEPIVNLVGAEIV 

EIDTTENGFVLTPEMLEKAILEQGDKLKAVI^ 

LTYTGEAHVSLGTMLRDQAI I INGLSKSHAMTGWRLGL I FAPATFTAQLIKSHQYLVTAANTMAQHAAVE 
ALTAGKNDAE PMKKE Y I QRRDYI I EKMTALGF E 1 1 KPDGAF YI F AKI P AGYNQD S FAFLKDFAQKKAVAF 
IPGAAFGRYGEGYVRLSYAASMETIKEAMKRIjEEYMREA 

SEQ ID NO: 63 
>AnrP401255 

MKLIVSVMPRSLEEAQALDATRYLDADIIEWRADYLPKEAILQVAPAIFEKFAGRELVFTLRTRSEGGEI 
DLSPEEYIHLIKEVAQLiYQPDYIDFEYYSYKDVFEEMLDFPNLVLSYHNFQETPENMMEILSELTIIiNPK 
LVKVAVMAHTEQDVLDIiMNYTRGFKTLNPEQEWTISMGKVGKVSRITADVTGSSWSFASLDEVSAPGQI 

S L ASMKKI RE I LDE A 

SEQ ID NO: 64 
>AnrP408652 

MTTLFSKIKEVTELAAVSGHEAPVRAYLREKLTPHVDEWTDGLGGIFGIKHSEAVDAPRVIiVASHMDEV 
GFWSEIKPDGTFRWEIGGWNPMWSSQRFKLLTRDGHEIPVISGSVPPHLTRGKGGPTMPAIADIVFD 
GGFADKAEAESFGIRPGDTIVPDSSAILTANEKNIISKAWDNRYGVLlWSEIiAEALSGQKLGNELYIiGSN 
VQEEVGLRGAHTSTTKFDPEVFLAVDCSPAGDVYGGQGKIGDGTLIRFYDPGHLLLPGMKDFLLTTAEEA 
GIKYQYYCGKGGTDAGAAHLiKNGGVPSTTIGVCARYIHSHQTLYAMDDFLEAQAFLQALVKKLDRSTVDL 

IKHY 

SEQ ID NO: 65 
>AnrP422 671 

MKIDKYSAILGNTVGFHNMSTLTDHRPVASLPFGGKYRLIDFPLSSLANAGVRSVFGIFQQDNISSVFDH 
IRSGREWGLSTLLSHYYLGIYNTRVESSTVGKEYYQQLIiTYLKRSGSNQTVALNCDVLINIDLNQVFHLH 
STTKEPITWYKKLAKKDISEVNAILDVDETDHVLSHKLFDSKSTAETFNMSTDIFWDTPWLIEHLEEE 
AKKEHPEKLRYVLRDLAVKEGAFAYEYTGYLANIHSVKSYYQANIDMLESQKFYSLFSPNQKIYTKVKNE 
EPTYYANTSKVSTSQFASGSIIEGQVANSVIiSRNIHVHKDSLVKDSLLFPRWIGEGAQVEYAILDKGVE 

VE PGW I RGTAEH PVWKKGAKVTED I HS 

SEQ ID NO: 66 
>AnrP441701 

MASKMLHTCLRVENLEKS I AFYQDAFGFKELRRRDF PDHAFTIVYLGLEGDDYELELTYNYDHGPYWGD 
GFAHIAIiSTPDLEALHQEHSAKGYEVTEPNGLPGTTPNYYFVKDPDGYKVEVIREK 

SEQ ID NO: 67 
>AnrP454140 

MS I H I AAQQGE I ADKILL PGD PLRAKF I AENFLDDAVC FNEVRNMFGYTGT YKGHC VS VMGTGMGMP S I S 
I YARELI VDYGVKKL IRVGTAGSLNEEVHVRELVLAQAAATNSNI VRJSTDWPQYDF PQ IAS FDLL.DKAYHI 
AKKLGMTTHVGNVIjSSDVFYSNYFEKNIELGKWGVKAVEMEAAAL 
EDTTAEERQNTFTDMMKVGIiETL I AE 
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SEQ ID NO: 68 lw,/ 

>AnrP454806 

MPNYI KADQFF YPHGVRRGGYIiELVDGKFGKHVEQ I PEGAEVI DYTGYS I APGLVDTH I HGYAGVDVMDN 
NIEGTLHTMSEGLLSTGVTSFLPTTIiTATYEQLLAVTENLGNHYKEATGAKIRGIYYEGPYFTETFKGAQ 
NPTYMRDPGVEEFHSWQKAANGLLNKIALAPERDGVEDFVRTVTGEGVTVALGHSISrATFDEAKKAIDAGA 

SVWVHAYNGMRGLTHRELGIWGAMYQL^ 

EDGDYMLGEFPVVVANGTARIjKSTGNIjAGSILKLKDGLKNVV 
GQ I REGYDADF I VXjDKDLELVATYLDGVKRYQA 

SEQ ID NO: 69 
>AnrP455828 

MTYWAIDIGGTNIKYGLVDQEGQLLESHEMPTEAHKGGPHILQKTKDIVASYLEKGPVAGVAlSSAGiyCV- 
DPDKGEIFYAGPQIPNYAGTQFKKEIEESFTIPCEIENDVNCAGIiAEAVSGSGKGASVTLCLTIGTGIGG 

CLIMDRKVFHGFSNSACEVGYMHMQDGAFQDLASTTALVKWAEAHGEDVDQWNGRRIFKEATEGNKICM 
EGIDRMVDYLGKGLANICWANPEWILGGGIMGQEAILKPKIRTALKEALVPSLAEKTRLEFAHHQNTA 

GMLGAYYHFKTKQS 

SEQ ID NO: 70 
>AnrP463907 

MASKDFHWAETGIHARPATLLVQTASKFASDITLEYKGKSVNLKSIMGVMSLGVGQGADVTISAEGADA 
DDAIAAI S ETMEKEGLA 

SEQ ID NO: 71 
>AnrP490410 

MSIVIGADAAGLRLKEWKDFLEKENFHLVDVTAEGQDFVDVTLAVAAEVNKEEQNLGIVIDAYGAGPFI 
VATKIKGMVAAEVSDERSAYMTRGHNNSRMITMGAQLVGDELAKNIAKGFWGKYDGGRHQIRVDMLNKM 

G 

SEQ ID NO: 72 
>AnrP517215 

METYYKAINWNAIEDVIDKSTWEKLTEQFWLDTRIPLSNDLDDWRKLSNKEKDLVGKVFGGLTLLDTMQS 
ETGVQALRADIRTPHEEAVFNNIQFMESVHAKSYSSIFSTLNTKAEIEEIFEWTNTNPYLQKKAEIVNEI 
YLNGSPLEKKVASVFLETFLFYSGFFTPLYYLGNNKLANVAEIIKLIIRDESVHGTYIGYKFQLGFNELP 

EEEQEKLKEWMYDLLYTLYENEEGYTESLYDGVGWTE 
VMNG I S TGT SNHDF F S Q VGNG YLLGEVE AMQDDD YNYGLD 

SEQ ID NO: 73 
>AnrP525615 

MVKVAVIIiAQGFEEIEALTVVDVLRRANITCDMVGFEEQVTGSHAIQVRADHVFDGDL 
PGSAHLRDNQTLIQELQSFEQEGKKLAAICAAPIALNQAEILKNKRYTCYDGVQEQILDGHYVKETVWD 

GQLTTSRGPSTALAFAYELVEQLGGDAESLRTGMLYRDVFGKNQ 

SEQ ID NO: 74 
>AnrP579600 

ME I SLLTDVGQKRTNNQD YVNHYVNRAGRTM 1 1 LADGMGGHRAGNI AS EMAVTDLGVAWVDTQ I DTVNEV 
REWFAHYLEIENQKIHQLGQDEAYRGMGTTLEVLAIIDNQAIYAHIGDSRIGLIRGEEYHQLTSDHSLVN 
ELLKAGQLTPEEAEAHPQKNIITQSIGQKDEIQPDFGTVILESGDYLLLNSDGLTNMISGSEIRDIVTSD 

I PLADKTETLVRFANNAGGLDNITVALVSMNEEDAE 

SEQ ID NO: 75 
>AnrP582187 

MSQYKIAPSILAADYANFEREIKRLEATGAEYAHIDIMDSHFVPQISFGAGWESIiRPHSKMVFDCHLMV 
SNPEHHLEDFARAGADIISIHVEATPHIHGALQKIRSLGVKPSWINPGTPVEAIKHVLHLVDQVLVMTV 
NPGFGGQAFIiPETMDKVRELVALREEKGLNFEIEVDGGIDDQTIAQAKEAGATVFVAGSYVFKGEVNERV 

QTLRKQLD 

SEQ ID NO: 76 
>AnrP623633 

MANKQDLIAKVAEATELTKKDSAAAVEAVFAAVADYLAAGEKYQLIGFGNFEVRERAERKGRNPQTGKEM 
T I AASKVP AFKAGKALKDAVK 
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SEQ ID NO: 77 "™ 
>AnrP649974 

MEVFESLKANLVGKNARIVLPEGEEPRILQATKRLVKETK^ 
QHYPQFEEIWSALVERRKGKMTEEDVRKVLVEDVN^ 

TRPWTRTSGAFLWRGTERYIiFGDCAININPDAEAIiAEIAINSAITAKMFGIEPKIAMLSYSTKGSGFG 
E S VDKAA7EATKI AHDLRPDLE I DGELQFDAAF VPETAAIjKAPGS TVAGQA1WF I F PGI E AGNI GYKMAER 
LGGFAAVGPVTjQGLNKPVNDIj S RGCNADDVYKLTLi I TAAQAVHQ 

SEQ ID NO: 78 
>AnrP6 53724 

MTDNFFGKILAVRKIDAIPGMLEFDIPWGDNRGWFKENFQKEKMEPLiGFPESFFAEGKLQNNVSFSRKN 
VLRGDHAEPWDKYISVADGGKVLGSWVDLRE 

LVNDYWALELKPKYAFVNYADPSLGIEWENIAEAEVSEADKHHPIiLKDVKPLKKEDL 

SEQ ID NO: 79 
>AnrP682812 

MKQTIILLYGGRSAEREVSVLSAESWRAVNYDRFTVKTFFISQSGDFIKTQEFSQTPGQEDRLMTNATI 
DWDKQVAPSAIYEEGAWFPVLHGPMGEDGSVQGFLEVLKMPWGCNILSSSLAMDKITTKRVLESAGIA 
QVPYVAIVEGDDLTAKIAEVEEKLTYPVFTKPSNMGSSVGISKSENQEELRQALKLAFQYDSRVLVEQGV 
NARE I EVGLLGNYDVKSTL PGE WKDVAF YDYDAKYIDNKI TMD I PAKI S DDWP IMRQNAKTAF RAI GG 
LGLSRCNFFYTDKGEISLNKLiNTMPGFTQWSMYPLLWDNMRI SYPDLIERIjVDIjAKESFDKREAHIjL 

SEQ ID NO: 80 
>AnrP692 615 

MKILVTGFNPFGGEKINPALEAVKLLPSEINGAEVRWVEIPTVFYKSSEVLEAEILRYQPDAVLCIGQAG 
GRTGLTPERVAINQDDARIPDNEGNQPIDTPIRIDGASAYFSSLPIKAMVQAIKKQGLPAWSNSAGTFV 
CNHLMYQALYLVDKKFPNMRAGFMHI PYMMEQWNKPNTAGMSLCD I VRGI EVAI EAI VDYKDKDIiQLVG 
GETH 

SEQ ID NO: 81 
>AnrP7 01774 

MNEVKK3WELKKEAVKDVTSLTKAAPVALAKTKEVLNQAVADLWAW 

EYMEALDGQLDElSERLITLGGSPFSTIiTEFLQNSEIEEEAGEYRNVEESLERVLVIYRYLSELFQKGIiD 
VTDEEGDDVTNGIFAGAKTETDKTIWMLAAELGQAPGL 

SEQ ID NO: 82 
>AnrP707898 

MKAYTYVKPGLASFVDVDKPVIRKPTDAIVRIVKTTICGTDLHIIKGDVPTCQSGTILGHEGIGIVEEVG 
EGVSNFKKGDKVLISCVCACGKCYYCKKGIYAHCEDEGGWIFGHLIDGMQAEYLRVPHADNTLYHTPEDL 

SDEALiVMLSDILPTGYEIGVLKGKVEPGC SVAIIGSGPVGLAAIjLiTAQFYSPAKIjIMVDIjDDNRIjETAIjS 
FGATHKVNSSDPEKAIKEIYDLTDGRGVDVAIEAVGIPATFDFCQKIIGVDGTVANCGVHGKPVEFDLDK 

LWIRNINVTTGLVSTNTTPQLLKAL^^ 
EA 

SEQ ID NO: 83 
>AnrP727368 

M I Q PA S L E EL AS L VEKAGKKVF I F VADWC S D C RY I Y PAL P E I E ETNP E F T F I RMDRD Q YMD L AKLWD VYG 
IPSLWLEKDKEIGRFWRDRKSKQQINDFLAGLK 

SEQ ID NO: 84 
>AnrP727454 

MAQRYQNIWAIDGSKEADLAFVKGVHSALRNDAKLTIAHVIDTRALQSVSTFDAEVYEE 
EYEKRAKDAGVADVHIVIEMGNPKTLLARTIPDAEEVDLILVGATGLNAFERLLVGSSSEYILRHAKVDL 

LWREQEKTL 

SEQ ID NO: 85 
>Ari3rP7 48591 

MKIRGFELVS SFTDENLLPKRETAHAAGYDL KVAVRTWAPGEI VLVPTGVKAYMQPTEVLYLYDRS SNP 
RKKGLVLINSVGVIDGDYYGNPGNEGHIFAQMKNITDQEWLEVGERIVQAVFATFLIADGDAADGVRTG 

GFGSTGH 
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SEQ ID NO: 86 12/47 

>AnrP749497 

MAFIEKGQEIDMEVIKAETQLSAEALRLKESR^ 
ALQKKVADKIFMVMRWTAKPRTNGD^^ 

YPSNLILVDDLVSYHAVGARSVEDQEHRFVASGIDAPVGM 
VETSGNPLAHVILRGAVNEYGNYMPNYYYEIS^ 

QTLQNRDWNEKIKKTVRGFMIESYLADGRQNQPEIFGCSITDPCLGWENTEAIiVEEIYVTLTK 

SEQ ID NO: 87 
>AnrP754359 

MSAIERITKAAHLIDMNDIIREGNPTLRAIAEEVTFPLSDQEIIIjGEKMM 

VGLAAPQLDISKRIIAVLVPNIVEEGETPQEAYDLEAIMYlSrPKIVSHSVQDAAIjGEGEGCL 

WRHARVTVDYFDKDGEKHRIKLKGYNSIWQHEIDHINGIMFYDRINEKDPFAVKDGLLILE 

SEQ ID NO: 88 
>AnrP755180 

MSYQENYQKWVDFVELPDYIiRQDLENMDEKTKEDAFYTNLEFGTAGMRGLVGAGTNRINIYWRQATEGL 

ARLIESKGGNEKERGVAIAYDSRHFSPEFAFESAAVLAKHGIKSYVFESIiRPTPELSFAVRHLNCFAGIM 
VTASHNPAPFNGYKVYGEDGGQMPPHDADALTTYIRAIENPFAVEVADVETEKASGLIEVIGEAVDIEYL 
KEVKDININPALIEEFGKDMKIVYTPLHGTGEMIiARRALAQAGFDSVQWEAQATADPDFSTVTSPNPES 
QAAF ALAE ELGRQVGADVLVATD PDADRVGVEVL QKDGS YLNL S GNQ I GA IMAKYI L E AHKNAGTL PENA 
ALCKS IVSTDIjVTKIAESYGATMFNVLTGFKFIAEKIQEFEEKHNHTYMMGFEESFGYLIKPFVRDKDAI 
QAVL WAELAAYYRS RGLTLADG I EE I YKE YGYYAEKT I S VTLS GVDGAEQ I KAIiyiAKFRNNA PTEWNAT 

AITWEDFKAQTATVADGTVTNLTTPPSDVLKYTLADGSWIAVRPSGTEPKIKFYIAWGETNEESQAKI 
ANIEAEINAFVK 

SEQ ID NO: 89 
>AnrP760417 

MKTKEWDELTVKRAITRITYEIIERNKDLNKIVLAGIKTRGVFIAHRIQERIiKQLENLSVPWELDTKP 
FRDDVKSGEDTSLVSVDVTDREVILVDDVIiYTGRTIRAAIDNIVGHGRPARVSLAVTiVDRGHRELPIRPD 
YVGKNIPTSRSEEI IVEMTELDDQDRVLITEEA 

SEQ ID NO: 90 
>AnrP763 57 

MLDVEAIRKDFPILDQIVNDEPLWLDNAATTQKPLWLKAINSYYEQDNAWHRGVHTLAERATASYEA 
ARETIRKFINAGSTKEVLFTRGTTTSLNWVARFAEEILTEGDQVLISVMEHHSNIIPWQEACRKTGAELV 
YVYLKDGALDMEDLRAKLTDKVKFVSLAHAS1WLGVVNP I KE I TQLAHQVGAIMWDGAQSTPHMKI DVQ 
DLDLDFFAFSGHKMAGPTGIGVLYGKEKYLEQMSPVE 

IGLATAVDYIiEKIGMDAVEAHEQELIAYWPKLQAIEGLTIYGSQDLAQRSGVIAFNIiGDIiHPHDLAT^ 
DYEGVAVRAGHHCAQPLLQYLEVPATARASFYIYNTKADCDKLVDALQKTKEFFNGTF 

SEQ ID NO: 91 
>AnrP769928 

MAIILPELPYAYDALEPYIDAETMHLHHDKHHQTYVNNANAALEKHPEIGEDLEALLADVESIPADIRQA 
LINNGGGHLNHALFWELMTPEKTAPSAELAAAIDATFGSFEEFQAAFTAAATTRFGSGWAWLVVNKEGKL 
EVTSTANQDTPISEGKKPILGLDWEHAYWKYRNVRPDYIKAFFSVINWNKVDELYAAAK 

SEQ ID NO: 92 
>AnrP7750 

MNEVKK1WELKKEAVKDVTSLTKAAPVALAKTKEVLNQAVADLYV 

EYMEALDGQLDEISERLITLGGSPFSTLTEFLQNSEIEEEAGEYRNVEESLERVLVIYRYLSELFQKGLD 
VTDEEGDDVTNGIFVGAKTETDKTIWMLAAELGQAPGL 

SEQ ID NO: 93 
>AnrP796725 

MTDNFFGKTLAARKVEAIPGMLEFDIPVHGDNRGWFKENFQ 

VLRGLHAEPWDKYISVADGGKVQGSWVDLREGETFGNTYQTVIDASKGIFVPRGVANGFQVLSDTVSYSY 
IiVNDYWALELKPKYAFVNYADPSLGIEWENIAEAEVSEADKHHPLLKDVKPLKKEDLE 
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SEQ ID NO: 94 u/ 
>AnrP81562 

MTLAKDIASHLLKIQAVYLKPEEPFTWASGIKSPIYTDNRVTLAYPETRTIilENGFVEAIKEAFPEVEVI 
AGTATAGI PHGAI I ADKMDLPFAY IRSKPKDHGAGNQIEGRVAQGQKMVWEDLI STGGSVLEAVAAAKR 
EGADVLGWAIFSYQLPKADKNFADAGVKLVTL^ 

SEQ ID NO: 95 
>AnrP825823 

MTAIDFTAEVEKRKEDLLADLFSLLEINSERDDSKADAQHPFGPGPVKALEKFLEIADRDGYPTKNVDNY 
AGHFEFGDGEEVLGIFAHMDWPAGSGWDTDPYTPTIKDGRLYARGASDDKGPTTACYYGLKIIKELGLP 
TSKKVRFIVGTDEESGWADMDYYFEHVGLAKPDFGFSPDAEFPIINGEKGNITEYLHFAGENTGVARLHS 
FTGGLRENMVPESATAWSGDLADLQAKLDAFVAEHKLRGELQEEAGKYKVTIIGKSAHGAMPASGVNGA 

TYLALFLSQFGFAGPAKDYLDIAGKILLNDHEGENLKIAHV^ 

RYPKGTSPEQIKSILENLPWSVSLSEHGHTPHYVPMEDPLVQTL.LNIYEKQTGFKGHEQVIGGGTFGRL 
LERGVAYGAMFPDSIDTMHQANEFIALDDLFRAAAIYAEAIYELIK 

SEQ ID NO: 96 
>AnrP867340 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIWRSQEKKDALYEL^ 

QSAVFLIiFVGDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKSE 

EVAELFNLPDYTYSVFGMALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTS 

WSQRLAEQFGQAEPSSTRKNLEQKKLL 

SEQ ID NO: 97 
>AnrP889903 

MSKILVFGHQNPDSDAIGSSVAFAYLAKEAYGLDTEAVALGTPNEETAFVLNYFGVEAPRVITSAKAEGA 
EQVILTDHNEFQQSVSDIAEVEVYGWDHHRVANFETASPLYMRLEPVGSASSIVYRMFKEHGVAVPKEI 
AGLMLSGLISDTLLLKSPTTHPTDKIIAPELAELAGVNLEEYGLAMLKAGTNLASKSAEELIDIDAKTFE 
LNGNNVRVAQVNTVD I AEVLERQAE I EAAMQAANESNGYSDFVLMI TDIVNSNSEILALGANMDKVEAAF 
NFKLENNHAFLAGAVSRKKQWPQIiTESFNA 

SEQ ID NO: 98 
>AnrP92 0 891 

MSDCIFCKIIAGEIPASKWEDEQVIjAFLDISQVTLGHTLWPKEHYRNIiLEMDATSASQLFAQVPKVAQ 
KWKVTKAAGMNIISNCEEVAGQTVFHTHVHLVPRYSADDDLKIDFIAHEPDFDKLAQVAETIKNA 

SEQ ID NO: 99 
>AnrP927145 

MILITGANGQLGTELRYLLDERNEEWAVDVAEMDITDAEMVEKYFEEVKPTLVYHCAAYTAVDAAEDEG 
KELDFAINVTGTKNVAKASEKHGATIiVYISTDYVFDGKKPVGQEWEVDDRPDPQTEYGRTKRMGEELVEK 
HVSNFYI I RTAWFGNYGKNFVFTMQl^AKTHKTLTVVNDQYGRPTWTRTLAEFMTYLAENRKEFGYYHL 
SNDATEDTTWYDFAVEILKDTDVEVKPVDSSQFPAKAKRPLNSTMSLAKAKATGFVIPTWQDALQEFYKQ 

EVR 

SEQ ID NO: 100 
>AnrP9312 

MNNLPNC PKCNSEYVYEDGALLVC PECAHEWNPAEVAEVEEGLVAI DANGNKLADGDTVTLIKDLKVKGA 
PKDLKQGTRVKNI RI VEGDHNI DC KI DGF GAMKLKS EFVRKI 

SEQ ID NO: 101 
>AnrP938540 

MEFMLDTLNLDEIKKWSEILPLAGVTSNPTIAKREGSINFFERIKDVREDIGSTPSIHVQVISQDFEGIL 
KDAHKIRRQAGDDI F I KVPVTPAGLRAI KALKKEGYHI TATAI YTVI QGLL AI E AGADYL APYYNRMENL 
NI DSNS VI RQLALAI DRQNS P S KI LAAS FKIWAQVNNALAAGAHAVTAGADVFE S AFAMP S I QKAVDDF S 
DDWFVIQNSRSI 

SEQ ID NO: 102 
>AnrP94874 

MTKLYGSIiEAGGTKFVCAVGDENFNWEKTQFPTTTPIETIDKTIEFFSKFDNLAGLAVGSFGPIDIDKN 

S KT YGF I TTTPKPNWANVDLLGAIiRRALNVPMYFTTDVNS S AYGE MVARNTSTAGGRIENLVYYTIGTGIGA 
GVIQRGEFIGGVGHPEMGHYYVARHPMDIEKEFKGVCPFHKGCLEGYAAGPSLEARTGVRGETIELNNPV 
WDVQAYYIAQAAWATVTFRPDVIVFGGGVMAQQHMLDRVREKFTSLDNGYLPVPDVRDYIVTPAVAGNG 

SATLGNFVLAKEVSK 
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>AnrP961387 

MNTYEGNLVANNIKIGIWARFNEF 

KYDAX I C LGAVI RGNTS HYDYVC S EVS KG I AQ I SLNS E I PVMFGVLTTDT I EQAI ERAGTKAGNKGS EC A 
QGAIEMVNLIRTLDA 

SEQ ID NO: 104 
>AnrP9 64574 

MKLSNRVLEMEESVTLAAGARAKALKAEGRDILSLTLGEPDFTTPKNIQDAAIASIRDGRASFYTVTSGL 
PELKAAVNS YFERF YGYS VASNQVTVAAGAKYSLYTFFMAWNPGDEVI I PTPYWVS YGDQVKMAEGVPV 
FVSAKEDNHFWTVEQLEAARTDKTKVLVLNSPSNPTGMIYTREELLAIGNWAVENDILILADDIYGRLV 
YNGHEFTP I S SLSEAIRKQTWINGVSKTYAMTGWRI GYAVGEADI I AAMSKI AGQTTSNP SAVAQYAAV 
EALSGEQDTVESMRQAFEERLNTIYPLLAEVPGFEWKPQGAFYLFPWKKAMEMKGYTDVTDFTTVILE 
EAEVALVTGAGFGAPENVRLSYATDLDTLKEAVERLKAFMGSEND 



SEQ ID NO: 105 
>AnrP970091 

MbENDIKKVLVSHDEITEAAK^ 

HGGTASSGVINIKQDVTQDIKGRHVLFVEDIIDTGQTLKNLRDMFKEREAASVKIATLLDKPEGRWEIE 
AD YT C F TI PNE F VVGYGLD YKENYRNL P Y I GVLKEE VYSN 

SEQ ID NO: 106 
>AnrP896324 

MKAVVWPESTGVAIEEKVLRPLETGEALVEVEYCGVCHTDLHVAHGDFGQVPGRVLGHEGIGIVKEIAP 
DVKSLKVGDRVSVAWFFEGCGTCEYCTTGRETLCRTVKNAGYSVDGGMAEQCIVTADYAVKVPDGLDPAQ 
ASSITCAGVTTYKAIKEAKVEPGQWVVLYGAGGLGNIiAVQYAKKVFNAHVIAVDINTSTO 
VINGLEVEDVAGL I KEKTDGGAHS AWTAVS KVAFNQAVD S I RAGGRWAVGL P S EMMED S I VKTVLDG I 
QVIGSLVGTRKDLEEAFQFGAEGLWPWQKRPVEDAVAIFDEMEKGQIQGRMVLDFTH 

SEQ ID NO: 107 
>AnrP44215 

MTKTAFLFAGQGAQYLGMGRDFYDQYPIVKETIDRASQVLGYDLRYLIDTEEDKLNQTRYTQPAILATSV 
AIYRLLQEKGYQPDWAGLSLGEYSALVASGALDFEDAVALVAKRGAYMEEAAPADSGKWAVLNTPVEV 
IEEACQKASELGWTPANYNTPAQIVIAGEWAVDRAVELLQEAGAKRLIPLKVSGPFHTADLEPASQKL 
AETLAQVSFSDFTCPIiVGNTEAAVMQKEDIAQLLTRQVKEPVRFYESIGVMQEAGISNFIEIGPGKVLSG 
FVKKIDQTAHLAHVEDQASLVALLEK 

SEQ ID NO: 108 
>AnrP450910 

MKLNRWVTGYGVTSPIGNTPEEFWNSLATGKIGIGGITKFDHSDFDVHNAAEIQDFPFDKYFVKKDTNR 
FDNYSLYALYAAQEAVNHANLDVEALNRDRFGVIVASGIGG 

Nl^SGNVAMRFGANGVCKSINTACSSSNDAIGDAFRSIKFGFQDWLVGGTEASITPFAIAGFQALTALS 
TTEDPTRASIPFDKDRNGFVMGEGSGMLVLESLEHAEKRGATILAEWGYGNTCDAYHMTSPHPEGQGAI 
KAIKLALEEAEISPEQVAYWAHGTSTPANEKGESGAIVAVLGKEVPVSSTKSFTGHLLGAAGAVEAIVT 
I EAMRHNFVPMTAGTS EVSD YI EANWYGQGLEKE I PYAI SNTFGFGGHNAVLAFKRWENR 



SEQ ID NO: 109 
>AnrP10361 

MWKTWEAQDIFDKAWEGFKGVDWKEKASVSRFVQANYTPYDGDESFLAGPTERSLHIKKIVEETKAHY 

EETRFPMDTRPTSIADIPAGFIDKENEVIFGIQNDELFKDNFMPKGGIRMAETTLKENGYEPDPAVHEIF 

TKYVTTVNDGI FRAYTSNI RRARHAHTVTGLPDAYSRGRI I GVYARLAL YGADYLMQEKVNDWNAIKEID 

EET I RLREEVNL QYQAL QQ WRLGDLYGVDVRKPAMNVKEAI QWVNI AFMAVC RVTNGAAT SLGRVP I VL 

DIFAERDLARGTFTESEIQEFVDDFVMKLRTVKFARTKAYDQLYSGDPTFITTSMAGMGNDGRHRVTKMD 

YRFLNTLDNIGNS PEPNLTVLWTDKLPYNFRRYCMHMSHKHS S I QYEGVTTMAKDGYGEMSC I SCCVSPL 

DPENEEQRHNIQYFGARVNVLKALLTGLNGGYDDVHKDYKVFDIEPIRDEVLEFESVKANFEKS 

TYVDALNIIHYMTDRYNYEAVQMAFLPTKQRANMGFGICGFANTVDTIiSAIKYATVKPIRDEDGYIYDYE 

TIGDYPRWGEDDPRSNELAEWLIEAYTTRLRSHKLYKDAEATVSLLTITSNVAYSKQTGNSPVHKGVYLN 

EDGSVNLSKLEFFSPGANPSNKAKGGWLQNLNSLSSLDFSY 

ILDGYFENGGQHVNLNVMDIjNDVYEKIMSGEDVIVRISGYCVNTKYLTPEQKT 

DALS 
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SEQ ID NO: 110 
>AnrP674643 

MPITAADIRREVKEKNVTFIRLMFS 

LYPDLDTWTVFPWGDENGSVAGLICDWTTEGEPFAGDPRGNLKRALRHMEEVGFKSFNIiGPEPEFFLFK 
LDENGDPTLEVNDKGGYFDLAPTDLADNTRREIVNVIjT 

KIQIFKLWKTIARKHGLYATFMAKPKFGIAGSGMHCNMSLFDAEGNNAFFDPNDPKGMQ 

GL IKHAYNYTAIMNPTVNSYKRIjVPGYEAPVYI AWAGRNRS PLVRVPASRGMGTRLELRSVDPMANPYVA 

MAVLL EVGL YG I ENKI E AP AP I EENI YIMTAEERKE AG I TDL P S TLHNAIiKALTEDE WKAALGDH I YT S 

FLEAKRIEWASYATFVSQWEIDNYLDLY 



SEQ ID NO: 111 
>AnrP208610 

MSYKTSNAEGHVDFINTYDLEPMAQQVIPKAAFGYIASGAEDTFTLRENIRAFNHKLIVP 
TEIEFAGEKLSSPIIMAPVAAHKLANEQGEVATARGVHEFGSLYTTSSYSTVDLPEISEALQGTPHWFQF 

YF SKDDGINRHIMDRVKAEGYKAIVLTADATVGGNREVDKRNGFVF PVGMP I VEEYL PEGAGKSMDFVYK 
SAKQRLSPRDVEFIAEYSGLPVYVKGPQCREDVERSLAAGASGIWVTNHGGRQIDGGPAAFDSLQEVAEA 
VDRRVPIVFDSGVRRGQHVFKALASGADLVAIGRPVIYGLALGGSVGVRQVFEHLNAELKTVMQLSGTQT 

IEDVKHFKLRHNPYNPTFPVDPRDLKLY 



SEQ ID NO: 112 
>AnrP234353 

MKEGIPKMGKIEVINHPLIQHKLSILRRTDTSTKAFRELVDEIAMLMGYEVLRDLPLEDV 
QKQLAGKKLAIVPILRAGIGMVDGLLSLVPAAIWGHI^ 

LATGGSAILAVDSLKKRGASNIKFVCLVSAPEGVKALQEAHPDVEIFTAALDERLNEHGYIVPGLGDAGD 
RLFGTK 



SEQ ID NO: 113 
>AnrP665711 

MSKFNRIHLWLDSVGIGAAPDANNF VNAGVPDGASDTLGHI SKTVGDNVPNMAKIGLGNI PRETPLKTV 
AAESNPTGYATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANKPYS 
GTAVIYDFGPRQMETGELIIYTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPAIiLGRIIARPYVG 
E PGNFTRTANRRDLAVS PFFPTVLDKLNEAGIDTYAVGKINDI FNGAGINHDMGHNKSNSHGIDTLLKTM 
GLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDERLPEIIAAMRENDDLLITADHGNDPTYAGTD 
HTREYI PLLAYS PAFKGNGL I PVGHFAD I S ATVADNFGVET AMI GE S FLDKLV 



SEQ ID NO: 114 
>AnrP881257 

MNKRVKIVATLGPAVEIRGGKKFGEDGYWGEKLDVEASAKNIAKLIEAGANTFRFNFSHGDHQEQGERMA 

TVKLAEKIAGKKVGFLLDTKGPEIRTELFEGEAKEYSYKTGEKIRVATKQGIKSTREVIALNVAGAIiDIY 

DDVEVGRQVLVDDGKLGIiRWAKDDATREFEVEVENDGI I AKQKGVNI PNTKI PFPALAERDNDDIRFGL 

EQGINFIAISFVRTAKDVNEVRAICEETGNGHVQLFAKIENQQGIDNLDEIIEAADGIMIARGDMGIEVP 

FEWPWQKMIIKKVNAAGKWITATNMLETMTEKPRATRSEVSDVFNAVIDGTDATMLS 

E S VTTMAT I DKNAQALLNE YGRLD SD S F ERNSKTEVMAS AVKDAT S SMDIKLWTLTKTGHTARLI SKYR 

PNADILALTFDELTERGLMIiNWGVIPMLTDAPSSTDDMFEIAERKAVEAGLVESGDDIVIVAGVPVGEAV 

RTNTMRI RTVR 



SEQ ID NO: 115 
>AnrP171086 

MTYPNLLDRFLTYVKVNTRSDEHSTTTPS 

SLTRKIGFISHMDTADFNAEGVNPQVIENYDGCVIELGNSGFKLDPADFKSLEKYPGQTLITTDGTTIiLG 

ADDKSGIAEIMTAIEYLTAHPEIKHCEIRVGFGPDEEIGVGANKFDAEDFDVDFAYTVDGGPLGELQYET 

FSAAGAELHFQGRNVHPGTAKGQMVNALQLAIDFHNQLPENDRPELTEGYQGFYHLMDVTGSVEEARASY 

IIRDFEKDAFEARKASMQSIADKMNEELGSNRVTLNLTDQYYNMKEVIEKDMTPITIAKA 

IEPIRGGTDGSKISFMGIPTPNIFAGGENMHGRFEYVSLQTMERAVDTIIGIVRSL 



SEQ ID NO: 116 
>AnrP120435 

MSDRKI^KLFALNSNQEIAQKIAQAVGVPLGKL^^ 

MELLIMVDACVRASAHSINVVLPYFGYARQDRIACPREPLTAKLVANMLV 

FDI PVDNLFTVPLFAKHYCDKGIiLGSDWWS PKNS GVKRARSLAE YLDAP I AI IDYPQDDATRNEGYI I 
GDVEGKKAI L I DD I LNTGRTF S EAS KI VEREGATE I YAVS SHGLFVEGAAELLDNTNI KE I LVTD S VATK 
EKTPKNVCYITASELIGDAIVRIHERKPVSPLFAYNKKK 
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>AnrP599544 

MTEMLKGIAASDGVAVAKAYLLVQPDriSFETITVEDTNAEEARLDAAIiQASQDELSVIREKAVGTLGEEA 
AQVFDAHLMVLAD PEMI S Q I KET I RAKKVNAE AGLKEVTDMF I T I FEGMEDNP YMQERAAD I RDVTKRVL 
ANLLGKKLPNPASINEEVIVIAHDLTPSDTAQLDKNFVK^ 

TEIVKDGDIIiAVNGITGEVIINPTDEQAAEFKAAGEAYAKQKAEWALLKDAQTVTADGKHFELAANIGTP 
KDVEGVNNNGAEAVGL YRTEFLYMD S QDF PTEDEQYEAY KAVLEGMNGKPVWRTMD I GGDKEL PYFDMP 
HEMNPFLGFRALRISISETGDAMFRTQIRALLRASVHGQLRIM 

GVAVADNI Q VGIMI E I PAAAML ADQF AKEVDF F S I GTNDL I Q YTMAAJDRMNEQVS YLYQPYNPSILRLIN 

NVIKAAHAEGKWAGMCGEMAGDQQAVPLLVGMGL^ 

STMEEVLELQKEYVNFD 

SEQ ID NO: 118 
>AnrP671474 

MGLKHLEDVTYFRLNNE INRPVNGQ IMLHKDKEALDAFFKENWPNTMVFD S I KDKINYL I EHNY I ETAF 
IKKYRPEFLEELAQFIKDQNFQFKSFMAAYKFYNQYALKT^ 

DIANEIIHQRYQPATPSFLNAGRARRGELVSCFLIQVTDDMNSIGRSINSALQIiSRIGGGVGITLSNLRE 
AGAP I KG YEGAASG WP VMKL F E D S F S YSNQLGQRQGAGVVYLNVFHPD 1 I AFL S TKKENADEKVRVKTL 
S LGVWPDKF YEL ARKNE EMYLF S P YS VEKE YGVPFNYID I TEKYDELVANPNI RKTKI KARDL ETE I S K 
LQQESGYPYVVNIDTANRANPVI)GKIIMSNLCSEILQVQEPSL^ 

TSPDFGRSIRAMVRALTFVTDSSHIVAVPTIDHGNSQAHTFGLGAMGIiHSYIiAQQIilEYGSPESVEFTSI 

YFMLMNYWTLVESNNIARERGITFHNFEKSDYANGSYFDKWTGEFVPTSDRVKELFKW 

LRDKVQEDGLYHQNRIiAVAPNGSISYINDVSASIHPITQRIEERQEKKIGKIYYPAAGLSTETIPYYTSA 

YDMDMRKVIDVYAAATEHVDQGLSLTLFMRSDIPKGIjYEWKRENKQTTRDLSILIWYAFNKGIKSIYYVR 

TFTDDGGEVGANQCESCVI 

SEQ ID NO: 119 
>AnrP144661 

MTSAKEYIQSVFETVKARNGHEAEFLQAVEEFFNTLEPVFEKHPEYIEENILiARITEPERWSFRVPWVD 
RDGKIQVNRGYRVQFNSAVGPYKGGLRFHPTVNQGILKFLGFEQIFKNVLTGLPIGGGKGGSDFDPKGKT 
DAEVMRFCQSFMTELQKHIGPSLDVPAGDIGVGGREIGYLYGQYKRLNQFDAGVLTGKPLGFGGSLIRPE 
ATGYGLVYYTEEMLKANGNSFAGKKWISGSGNVAQYALQKATEIiGATVISVSDSNGYVIDENGIDFDIiL 
VDVKEKRRARLTEYAAEKATATYHEGTVWTYAGNYDIAL) PCATQNEINGEAAKRLVAQGVIC VS EGANMP 
SDLDAIKVYKENGIFYGPAKAANAGGVAVSALEMSQNSLRLSWTREEVDGRLKDIMTNIFNTAKTTSETY 
GLDKDYLAGANIAAFENVANAMIAQGIV 

SEQ ID NO: 12 0 
>AnrP381397 

MSAYQLPTVWQDEASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELIiEAGFKE 
AAYDLYKIAIMDGDQFGSDFVKLNPNSKIPALLDQSGTENVRVFESAHILLYLAEKFGAFDPSNPVEKVE 
VIjNWLFWQAGAAPFLGGGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELiAQKPYIAGNDYTIADIAI 
WSWYGQLVQGNL YQGS AKFLDAS S YQNIiVKWAEKI ANRPAVKRGLEVTYTE I K 

SEQ ID NO: 121 
>AnrP649096 

MSRKPFIAGNWKMNKNPEEAKAFVEAVASKLP 

NAGAFTGETSPQVLKEIGTDYWIGHSERRDYFHETDEDINKKAKAIFANGMLPIICCGESLETYEAGKA 
AEFVGAQVSAALAGLTAEQVAASVIAYEPIWAIGTGKSASQDDAQKMCKWRDWAADFGQEVADKVRVQ 
YGGSVKPENVASYMACPDVDGALVGGASLEAESFLALLDFVK 

SEQ ID NO: 122 
>AnrPl74421 

MLTYDLIVIGFGKAGKTLAGKLASAGKKVALVERSKAMYGGTCINIGCIPTKTIiLVAAEKDLSFEEVIAT 
KNTI TGRLNGKNYTTVAGTGVD I FDAE AHFL SNKVI E I Q AGDEKQELTAET I VINTGAVSNVL P I PGL AT 
SKNVFDSTGIQSLDKLPEKLGVLGGGNIGLEFAGLYNKLGSKVTVLDTLDTFLPRAEPSIAALAKQYLEE 
DGIELLQNIHTTEIKHDGDQVLVVTEDETYRFDALLYATGRKPNVEPLQLENTDIELTERGAIKVDKHCQ 
TWPGVFAVGDWGGLQFTYISLDDFRVWSYLAGDGSYTLEDRIiNVPNTMFITPALSQVGLiTESQAADL 
KL PYAVKE I PVAAMPRGHVNGDLRGAFKAWNTETKE ILGAS I FSEGSQEI INI ITVAMDNKI PYTYFTK 
QIFTHPTLAENLNDLFAI 
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SEQ ID NO: 123 ■//-»/ 

>AnirP3 27251 

MLTYDLIVIGFGKAGKTLAGKIjASAGKK^ALVERSKAMYGGTC INIGC I PTKTIjLVAAEKDIjSFEEVTAT 

KNTITGRLNGKNYATVAGTGVDIFDAEAHFLSNKVIEIQAGDEKKELTAETIVINTGAVSNVLPIPG 

SKNIFDSTGIQSLDKLPEKLGILGGGNIGLEFAGLYNKLGSKVTVIiDAIiDTFLPRA.EPSIAALAKQyMEE 

DGIELLQNIHTTEIKNDGDQVLWTEDETYRFDALLYATGRKPWEPLQLENTDIEIiTERGAIKVDKHCQ 

TWPGVFAVGDWGGLQFTYlSIiDDFRWYSYLAGDGSYTLEDRIiWPNTMFITPALSQVGIiTESQAADL 

KLPYAVKEIPVAAMPRGHVNGDLRGAFKAVVNTETKEILGASIFSEGSQEIINIITVAMDNKIPYTYFTK 

QI FTH PTL AENLNDLF AI 

SEQ ID NO: 124 
>AnrP2332 6 

MILITGANGQLGTELRYLLDERNEEYVAVDVAEMDITDAEMVEKVFEEVKPTLVYHCAAYTAVDAAEDEG 
KELDFAINVTGTKNVARASEKHGATLWISTDWFDGKKPVGQEWEVDDRPDPQTEYGRTKRMGEELVEK 
HVSNFYI I RTAWVFGNYGKNFVFTMQNLAKTHKTLT WNDQ YGRPTWTRTLAEFMT YLAENRKEFGY YHL 
SNDATEDTTWYDF AVE I LKDTDVEVKPVD SSQF PAKAKRPLNSTMSLAKAKATGFVI PTWQDALQEF YKQ 
EVR 

SEQ ID NO: 125 
> An it P 3 922 69 

MKKIVLVSLAFLFVLVGCGQKKETGPATKTEKDTLQSALPVIENAEKNTWTKTLVLPKSDDGSQQTQTI 
TYKDKTFLSLAIQQKRPVSDELKTYIDQHGVEETQKALLEAEEKDKSIIEARKLAGFKLETKLLSATELQ 
TTTSFDFQVLDVKKASQLEHLKNIGLENLiliKNEPSKYlSDRLANGATEQ 

SEQ ID NO: 12 6 
>Anr P7 58033 

MPTLEI AQKKLEF I KKAEE YYNALCTNI 

DGDTRNSVMSGFFKSREKITGLTEFLSGTADLSHGLCDTNIENLFWQSGTVSPNPTALLQSKNFNDMIE 
TLRKYFDYI I VDTAP IGI VIDAAI ITQKCDAS ILVTATGEVNKRDVQKAKQQLEQTGKLiFLGVVFNKLD I 

SVDKYGVYGFYGNYGKK 

SEQ ID NO: 127 
>AnrP140539 

MAKSNFEKVESVVGWVRDKKITGYRISKETN^ 
D 

SEQ ID NO: 12 8 
>AnrP527554 

MAKGFAKGLVTGVAGTVAAVAGAVYAFKKKVI E PEEQKAAF I EENRKKAARRRVS R 

SEQ ID NO: 129 
>AnorPl99471 

MLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTIiRALEEIESGNVTIHPDPEGKREA 
VRRRIEEEKRRKEEEEKKIKEQIAKEKEDGEKI 

SEQ ID NO: 13 0 
>AnrP533516 

MSVEEKLNQAKGSIKEGVGKAIGDEKMEKEGAAEKWSKVKEVAEDAKDAVEGAVEGVKNMLSGDDK 

SEQ ID NO: 131 
>AnrP520183 

MSQSSYLSPLLiWLKKEADKEKMSATQCQIFFFYYQMFELLFARESDMKDLCLGTKGFYFSQLEKNLLSGV 
SRFLKNLEGKVTLKANQEVSARKALFLAI^ 

QSALEKDYIVKVIGDKHFVLKRQDATKLTARQTQTLEILSQSEDLVNPVYVTLGEKGVLLLD 

SEQ ID NO: 132 
>AnrP209695 

MDSFDKGWFVLQTYSGYENKVKENLLQRAQTYNMLDNILRVEIPTQTVQVEKNGKRKEVEENRFPGY^ 
EMVMTDEAWFWRNTPIWTGFVG 

ADYTGKITEIDNNKVKMI I SMFGNDTVAEVNLNQIAEL 
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>AnrP3 41409 

MTATKMNAQE 1 1 QF I ANAEKKT S VKVTFEGQLATAVP S S WKLGNVLFGDWKDVAPLiLEGIiVENQD YWE 
QDARNSAVPLLDKRAINARIEPGAIIRDQVEIGDNAVIMMGSVINIGAEIGAGTMIDMGAILGGRAIVGK 

NS HVGAGAVLAGVT E P AS AE P VRVGD3WL I GANAW I EGVQ I GS GS WAAGAI VT QDVPENVWAGVP AR 
I IKE IDAQTQQKTALEDAIiRTL 

SEQ ID NO: 134 
>AnrP456483 

MD YNNDKE E I SMIiKEVLTVAKVAKKS SLFLGGVAFGTLGLKILASKEAKKGYSKALAKAYKLKDELDASV 
SWKQHGDDVLQDAKYLYEQEKKEEQLDSLIGE 

SEQ ID NO: 135 
>AnrP349465 

MKKFFGEKQHRFSLRKLAIGLVSASISSDFFVSIASSGIVFAQENAAVHYKYVTDTELSSQEKDLIVKGI 
PKITEDSESTYYLVYRMDEKAQLGQLPNTGGQNSLTSV^^ 

TGMGSGLASSVHAIENQLIiLQYNQEYQLSQGDSLPLPRALSGYTYLGYIKQDKEINQQETAARDQKFDYT 
VQPHFQTNEGRQRAGDEQKAPSPTLPADKPIPSQDSSNQNPSGLASVDPQDEVLAGRVNKPELIiYKDQEI 
VTKLDVPELVQENPELTEGTIWKQEGRAGKKVEWRIFTVENQEISREVLSTKLEEALPRIVEKGTKKA 
WPSEAPQSAKKGEPETQAPLPEYNGNQAGTIVAPEIAEKPEYTGTQAGAWEPEQVAPLPEYQGTQAGA 
IVEPEQVEPEVGGVQSGALVEPETADKPTYTGEQSGAIVEPEQVPPTPEYKGTQAGAIVGPETTEKPEYT 
DTQSGAIVEPETQSSLPEYTGEQSGAIVAPETTEIPEYTGTQAGAWEPEQVAPLPEYTGNTEQVKPEAP 
TEKPKEKDPEKTIiELRNVSDLELYSQTNGTYKQHVSLDGVPSNPDTYFVKVTSSSFKDVYLPVASITAET 
KDGQPVYKITAKAEKLQQELENKYVDNFTFYLAKKATEETTTFTSFSNLVKAINQNLSGTYHLAASLNAN 
EVELEPEAKSYIKGTFTGQLIGEKDGKQYAIYNLKKPLFETLSGATVEKIiSLKNVSISGKDDIGSLAYEA 
QNGTKIKQVHVDGVLAGERGIGGIiLAKADQSSITESSFKGRIINTYETTAAYNIGGLVGHLTGSRALLTK 
S KATVAI S SNTNS SDQTVGGLAGLVDQDAQ I QD S YAEGD INNAKHF GRVAGVAGYLWDRT SNLEKHAGS L 
TNVL S D INVTNGNAI TGYH YNDMKVKDTF S SKANRVYNVTLVKDEVVSKE S FEERGTMLD AS Q I E S KKAA 
INPLTLPIVEPLSTSGKKDSDFSKVDHYQAKRDLAY 

SVVmKDNQVITDIVSNKETANKIiLLHYKDHSYEKLNLKYQADFANIiAEYSLGNTGLL 
IIKQVLPDLQKVDYRSEAIRKTLGISPKVEQTELYLEDQFAKTKEHLEDSLKKLLSADAGLAGDNPVTKG 

YLVDKIKRNKEALLLGLTYLERWYNFSYGQVW 

NVDTYSISLASHHGTTDLFSTLEHYRKVFLPNTSNNDWFKSETKAYIVEEKSNIAEVKAKQEQAETKYSI 
GVYDRITSATWKYKNMVLPLLT^ 

QRDHYDYWYRILDEQSREKLYRTILLYDAYKFGDDTTSGK 

GAYATGDGVYYMSYRMLDKDGAITYTHEMTHDSDQDIYLGGYGRRSGLGPEFFAKGLLQAPDQPSDATIT 
INSILKHSKSDSTEGSRLQVLDPTERFQNAADLQNYVHNMFDLIYML 

IENKYVKDPADGNEVYATNVVKEIjTEEEARNLNSFDSLIDHNIIjSAREYQTGDYERNGYYTIKIjFAPIFS 
ALSSEKGTPGDLMGRRIAYELLAAKGFKDGMVPYISNQYEEIAKQKGKTINLYGKERGLVTDDLVLEKVF 

EGKYASWADFKKAMYKERVDQFKNLKQVTFKDPTKPWPNYTTET 
YNPEYDSAVHKLKRAIFKAYLDQTKDFRTSIFKK 

SEQ ID NO: 13 6 
>AnrP648160 

MSFYNHKEIEPKWQGYWAEHHTFKTGTDTSKPKFYALDMFPYPSGAGLHVGHPEGYTATDILSRYKRAQG 
YNVLHPMGWDAFGL PAEQYAMDTGNDPAEFTAENI ANFKRQ INALGFS YDWDREVNTTDPNYYKWTQWI F 
TKLYEKGLAYEAEVPVNWVEELGTAIANEEVLPDG 

DWSESIKDMQRNWIGKSTGANVTFKVKGTDKEFTVFTTRPDTLFGATFTVLAPEHEIiVDAITSSEQAEAV 
ADYKHQASLKSDLARTDLAKEKTGWTGAYAINPWGKEMPIWIADYVLASYGTGAVmVPAHDQRDWEF 
AKQFDLPIVEVLEGGWEEAAYTEDGLHVNSDFLDGLNKEDAIAKIVAWLEEKGCGQEKVTYRLRDWIiFS 

RQRYWGE P I P 1 1 HWEDGT STAVP ETEL PLVL PVTKD I RP S GTGE S PLANLTDWL EVTREDGVKGRRETNT 
MPQWAGSSWYYLRYIDPHNTEKLADEDLLKQWLPVDIWGGAEHAVLHLLYARFWHKFLYDLGWPTKEP 

FQKLFNQGMILGTSYRDHRGALVATDKV^ 

DTLRWEMFMGPLDASIAWSEEGLEGSRKFLDRVYRLITSKEILAENNGALDKVYNETVKAVTEQIESLK 

FNTAI AQIilWFVNAANKEDKIjYVDYAKGF IQLIAPFAPHLAEELWQTVAETGES I SYVAWPTWDESKLVE 
DEIEIWQIKGKVRAKLWAKDLSREELQEIALADEKVKAEIDGKEIVKVIAVPNKIiVNIWK 

SEQ ID NO: 137 
>AnrP782362 

MTKANFGWGMAWGRNLALNIES 

MLMVQAGPGTDATIQAIiLPHLDKGDILIDGGNTFYKDTIRRNEELANSGINFIGTGVSGGEKGALEGPSI 

MPGGQKEAYELVADVLEE I S AKAP EDGKPC VTYI GPDGAGHYVKMVHNGI E YGDMQL I AE S YDLMQHLLG 
LSAEDMAEIFTEWNKGELDSYLIEITADILSRKDDEGQDGPIVDYILDAAGNKGTGKWTSQSSLDLGVPL 
SLITESVFARYISTYKEERVHASKVLPKPAAFNFEGDKAELIEKIRQALYFSKIISYAQGFAQLRVASKE 
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ISHSnA/NLiPFADI AS IWRDGC I IRSRFIjQKITDAYNRDADLT^SrLIjLDEYFIjDVTAKYQQAVRDI VALAVQAGV 
PVPTFSAAITYFDSYRSADLPANLIQAQRDYFGAHTYQRKDKEGTFHYSWYDEK 

SEQ ID NO: 138 
>AnrPl08886 

MREYDIIAIGGGSGGIATMNRAGEHGAQAAVIEEKKLGGTCVWGCVPKKIMWYGAQIAETFHQFGEDYG 
FKTTDLNFDFATLRRNRESYIDRARSSYDGSFKRMGVDLIEGHAEFVDSHTVSVNGEIilRAKHIVIATGA 
HPSIPNIPGAELGGSSDDVFAWEELPESVAILGAGYIAVELAGVLHTFGVKTDLFVRRDRPLRGFDSYIV 
EGLVKEMERTNLPLHTHKVPVKLEKTTDGITIHFEDGTSHTASQVIWATGRRPNVKGLQLEKAGVTLNER 
GF I QVDE YQNTWEG I YALGDVTGEKELTP VAI KAGRTL S ERLFNGKTTAKMD YS T I PTWF SHP AI GTV 
GLTEEQAIKEYGQDQIKVYKSSFASMYSACTRNRQESRFKLITAGSEEKWGLHGIGYGVDEMIQGFAVA 

I KMGATKADFDATVAI H PT S S EEF VTMR 

SEQ ID NO: 139 
>AnrP860746 

MS SGKIAQVIGPWDVLiFAAGEKLPE INNALWYKNDERKTKI VLEVALELGDGMVRTIAMESTDGLTRG 
MEVLDTGRPISVPVGKETLGRVFNVLGDTIDLEAPFTEDAERQPIHKKAPTFDELiSTSSEILETGIKVID 
LLiAP YliKGGKVGLFGGAGVGKTVL IQELI HNI AQEHGG I S VF AGVGERTREGNDLYWEMKE SGVI EKTAM 
VFGQMNEPPGARMRVALTGLTIAEYFRDVEGQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLiAT 
EMGQLQERITSTKKGSVTSIQAIYVPADDYTDPAPATAFAHLDSTTNLERKLVQLiGIYPAVDPLASSSRA 
LAPEIVGEEHYAVAAEVKRVLQRYHELQDIIAILGMDELSDEEKTLVARARRIQFFLSQNFNVAEQFTGQ 
PGSYVPVAETVRGFKEILDGKYDHLPEDAFRGVGSIEDVIAKAEKMGF 

SEQ ID NO: 140 
>AnrP5 66006 

MAINAQEISALIKQQIENFKPNFDVTETGWTYIGDGIARAHGLENVMSGELLNFENGSYGMAQNLESTD 
VGIIILGDFTDIREGDTIRRTGKIMEVPVGESLIGRWDPLGRPVDGIiGEIHTDKTRPVEAPAPGWQRK 
SVSEPLQTGLKAIDALVPIGRGQRELiIIGDRQTGKTTIAIDTILNQKDQDMICIYVAIGQKESTVRTQVE 
TL RQ YGALD YT IWTASAS Q P S PLLFL AP YAGVAMAEEFMYQGKHVL I VYDDL S KQAVAYREL S LLLRRP 
PGREAFPGDVFYLHSRLLERSAKVSDELGGGSITALPFIETQAGDISAYIATNVISITDGQIFLGDGLFN 
AGIRPAIDAGSSVSRVGGSAQIKAMKKVAGTLRIDLASYRELEAFTKFGSDLDAATQAKLNRGRRTVEVL 
KQPVHKPLPVEKQVTILYALTHGFLDTVPVDDIVRFEEEFHAFFDAQHPEILETIRDTKDLPEEAVLDAA 
ITEFLNQSSFQ 

SEQ ID NO: 141 
>AnrP50583 

MQEKIIiVTGGAGFIGTHTVIELIQAGHQWWDNLVNSNRKSLEWEGITGVEIPFYEADIRDTDTLRDI 
FKQEEPTGVIHFAGLKAVGESTRIPLAYYDNNIAGTVSLLKAMEENNCKNIIFSSSATVYGDPHTV 
DFPLSVTNPYGRTKLMLEEIIjTDIYKADSEWNWDLRYFNPIGAHESGDLGENPNGIPNNLL 
GKLEQVQVFGDDYDTEDGTGVRDYIHWDLAKGWAALKKIQK^ 

VGRPIPYRIVERRPGDIAACYSDPAKAKAELGWEAELDITQMCEDAWRWQSKHPNGFED 

SEQ ID NO: 142 
>AnrP309124 

MI E YKNVALRYTEKDVLRDVNLQ I EDGEFMVLVGP SGS GKTTMLKMINRLLEPTDGNIYMDGKRIKDYDE 
RELRLSTGYVLQAIALFPNLTVAENIALIPEMKGWSKEEITKKTEELDAKVGIiPVAEYGHRLPSEIiSGGE 
QQRVGIVRAMIGQPKIFIiMDEPFSALDAISRKQLQVLTKELHKEFGMTTIFVTHDTDEALKLADRIAVLQ 
DGEIRQVANPETIL,KAPATDFVADLFGGSVHD 

SEQ ID NO: 143 
>AnrP476264 

MSEKLVEIKDLEISFGEGSKKFVAVKNANFFINKGETFSLVGESGSGKTTIGRAIIGLNDTSNGDIIFDG 
QKINGKKSREQAAELIRRIQMIFQDPAASLNERATVDYIISEGLYNHRLFKDEEERKEKVQNIIREVGLLi 
AEHLTRYPHEFSGGQRQRIGIARALVMQPDFVIADEPISALDVSVRAQVLNLLKKFQKELGLTYLFIAHD 
LSWRFISDRIAVIYKGVIVEVAETEELFNNPIHPYTQALLSAVPIPDPILERKKVLKVYDPSQHDYETD 
KPSMVE I RPGH YVWANQAELARYQKGLN 

SEQ ID NO: 144 
>AnrP157536 

MTNSVFQGRSFLAEKDFTRAELEYLIGLSAHLKDLKKRNIQHHYLAGKNIALLFEKTSTRTRAAFTTAAI 
DLGAHPEYLGANDIQLGKKESTEDTAKVLGRMFDGIEFRGFSQRMVEELAEFSGVPVWN 
LADYLTVQENFGRLEGLTIiVYCGDGRNNVANSLLVTGAILGVNVHIFSPKELFPEKEIVELAEGFAKESG 
AHVLITEDADEAVKDADVLYTDVWVSMGEEDKFAERV 
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HTVYGKDVAEKFGVEEMEVTDEVFRSKYARHFDQAENRMHTIKAVMAATLGNLYIPKV 

SEQ ID NO: 145 
>AnrP555111 

MGRKWANI VAKKTAKDGANS KVYAKF GVE I YVAAKKGD PDPE SNS ALKF VI DRAKQAQVP KH 1 1 DKAI DK 
AKGNTDETFTEGRYEGFGPNGSMLIVDTLTSIWN^^^ 

GEDADAVFEQLLEADVDVDDVEAQEGTITVYTAPTDLHKAIVALRESGIEEFQVTELEMIPQSEVELSGE 
DLETFEKLYSVLEDDEDVQKIYTNVDGF 

SEQ ID NO: 146 
>AnrP166919 

MTKTIAINAGSSSLKWQLYLMPEEKVLAKGLIERIGLKDSISTVKFDGRSEQQILDIENHIQAVKILLDD 
LIRFDIIKAYDEITGVGHRWAGGEYFKESTWEGDVLEK^ 

SWVFDTSFHTSMPEKAYRYPLPTKYYTENKVRKYGAHGTSHQFVAGEAAKLLGRPLEDLKLITCHIGNG 
GS ITAVKAGKS VDTSMGFTPLGGIMMGTRTGDIDPAI I PYLMQYTEDFNTPEDI SRVLNRESGLLGVSAN 
S S DMRD I EAAVAEGNHEASLAYEMYVDRI QKH I GQYLAVLNGADAI VFT AGVGENAE S FRRDVI SGI SWF 
GCDVDDEKNVFGVTGDISTEAAKIRVLVIPTDEELVIARDVERLKK 

SEQ ID NO: 147 
>AnrP691494 

MSNLSVNAIRFLGIDAINKANSGHPGVVMGAAPMAYSLFTKQLHINPAQPNWINRDRFILSAGH 

ALLHLSGFEDVSMDEIKSFRQWGSKTPGHPEFGHTAGIDATTGPLGQGISTATGFAQAERFLAAKYNREG 

YNIFDHYTWICGDGDLMEGVSSEAASYAGLQKLDKLVVLYDSNDINLDGETKDSFTESVRDRYNAYGWH 

TALVENGTDLEAIHAAI ETAKASGKPSL I EVKTVIGYGS PNKQGTNAVHGAPLGADETASTRQALGWDYE 

PFEIPEQVYADFKEHVADRGASAYQAWTKLVADYKEAHPELAAEVEAIIDGRDPVEVTPADFPALENGFS 

QATRNSSQDAXjIWVAAKLPTFLGGSADLAHSNMTYIKTO 

HGGLRWGGTFFVFSDYVKAAVRLSALQGLPVTWFTHDSIAVGEDGPTHEPVEHLAGLRAMPNLNVFR 
ADARETQAAWYLAVTS EKTPTALVLTRQNLTVEDGTDFDKVAKGAYWYENAADFDTI L I ATGS EVNLAV 
SAAKELASQGEKIRWSMPSTDVFDKQDAAYKEEILPNAVRRRVAVEMGASQNWYKYVGLDGAVLGIDTF 

GASAPAPKVLAEYGFTVENLVKIVRNLK 

SEQ ID NO: 148 
>AnrPl05992 

MLSLQEFVQNRYNKTIAECSNEELYLALLNYSKLASSQKPVNTGKKKVYYISAEFLIGKLLSNNL 

YDDVKKELAAAGKDLIEVEEVELEPSLGNGGLGRLAACFIDSIATLGLNGDGVGLNYHFGLFQQVLKNNQ 

QETIPNAWLTEQISIWIiVRSSRSYQVPFADFTLTSTLYDIDVTGYETATKNRLRIjFDLDSVDSSIIKDGINF 

DKTDIARNIiTLFLYPDDSDRQGELLRIFQQYFMVSNGAQLIIDEAIEKGSNLHDLADYAWQINDTHPSM 

VIPELIRLLTARGIELDEAISIVRSMTAYTNHTILAEALEKWPLEFLQEWPHLiVPIIEELDRRVKAEYK 

D PAVQ 1 1 DE SGRVHMAHMD I H YGYS VNGVAALHTE I LKNS ELKAF YDL YP EKFNNKTNGI TFRRWLMHAN 
PRLSHYLDEILGDGWHHEADELEKLLSYEDKAAVKEKLESIKAHNKRKLARHLKEHQGVEINPNSIFDIQ 
IKRLHEYKRQQMNALYVIHKYLDIKAGNIPARPITIFFGGKAAPAYTIAQDIIHLILCMSEVIANDPAVA 

PHLQWMVENYNVTAASFL I PACDI S EQ I S LASKEAS GTGNMKFMLNGADTLGTMDGANVE IAEIjVGEEN 
IYIFGEDSETVIDLYAKAAYKSSEFYAREAIKPLVDFIVSDAVIiAAGNKERLERLYNEIilNKDWFMTLLD 

LEDYIKVKEQMLADYEDRDAWLDKVIVNISKAGFFSSDRTIAQYNEDIWHLN 

SEQ ID NO: 149 
>AnrP796530 

MANRKIWALGGNAILSSDPSAKAQQEALVETAKHLVK^ 
NPAFPLDSLVAMTEGSIGFWLKNALQNALLDEGIEKNVASVVTQVVVD 

KAEAEKSGATFKEDAGRGWRKWAS PKPVDIKE I ETI RTLLNNGQVWAAGGGGI PWKENNGHLTGVEA 
VI DKDFAS QRL AELVDADLF I VLTGVDYVF VNYNKPNQEKLEHVNVAQLEE Y I KQDQF APGSMLPKVE AA 
IAFVNGRPEGKAVITSLENDGALIESESGTI IEKG 

SEQ ID NO: 150 
>AnrP693335 

MSQEKYIMAIDQGTTSSRAIIFNKKGEKVSSSQKEFTQIFPQAGWVEHNANEIWNSVQSVIAGAFIESGV 
KPNQIEAIGITNQRETTVVWDKKTGLPIYNAIWQSRQTAPLAEQLKSQGYVEKFHEKTGIillDAYFSAT 
KVRWILDHVEGAQERAEKGELLFGTIDTWLWKLTDGAAHVTDYSNAARTMLYNIKELKWDDEILEILNI 
PKAILPEVRSNSEIYGKTAPFHFYGGEVPISGMAGDQQAALFGQLAFEPGWKNTYGTGSFIIMNTGEEM 

QLSENI^LTTIGYGINGKVYYALEGSIFIAGSAIQWLRDGL 

GLGAPYWNQNARGSVFGLTRGTSKEDFIKATLQSIAYQVRDIIDTMQVOT 

QFQADILGIDIARAKNLETTALGAAFLAGLSVGYWK^ 

VKATQVFAEVDD 
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>AnrP94921 

MSNWDTKFLKKGFTFDDVLLIPAESHVIjPN^ 

LGVIHKKTMSIAQQADEVRKVKRSENGVIIDPFFL^ 

TNRDLRFISDYIJQPISNHMTSENLVTAPVGTDLAT^ 

EFPNAAKDEFGRLLVAGAVGVTSDTFERAEALFEAGADAIVIDTAHGHSAGVLRKIAEIRAHFPDRTLIA 
GNI ATAEGARALYE AGVDWKVGI GPGS I C TTRVI AGVGVPQVTAI YDAAAVARE YGKT 1 1 ADGGI KYSG 
DIVKALAAGGNAVMLGSMFAGTDEAPGETEIFQGRKFKT^ 

EGIEGRVAYKGAAADIVFQMIGGIRSGMGYCGAANLKELHDNAQFIEMSGAGLKESHPHDVQITNEAPISr^ 
SM 

SEQ ID NO: 152 
>AnrPl09912 

MTSVVWGTQWGDEGKGKITDFLSANAEVIARYQGGDNAGHTIVIDGKKFKLHLIPSGIFFPEKISVIGN 
GMVWPKSWKELSYLHEEGVTTDNLRISDRAPf^ILPYHIELDRLQEEAKGDNKIGTTIKGIGPAYMD^ 
ARVGIRIADLLDKDIFRERLERNLAEKNRLFEKLYDSBCAIVFDDIFEEYYEYGQQIKKYVIDTSVILNDA 
IjDNGKRVLF EGAQGVMLD IDQGT YP FVT S SNP VAGGVTI GS GVGP S K I DKWGVCKAYT SRVGDGP F PTE 
LFDEVGERIREVGHEYGTTTGRPRRVGWFDSVVMRHSRRVSGITNLSIiNSlDVLSGLDTVKICVAYDLDG 
QRIDYYPASLEQLKRCKPIYEELPGWSEDITGVRNLEDLPENARNYVRRVSELVGVRISTFSVGPGREQT 
NILESVWS 

SEQ ID NO: 153 
>AnrP317174 

MSFSDLKLFALSSNKELAERVAQEIGIELGKSSVRQFSDGEIQVNIEESIRGKHVFILQSTSSPVNDNDL 
E I L IMVDAIjKRAS AE S VNWMP Y YG YARQDRKARARE PITS KLVANML EVAGVDRLLT I DLHAAQ I QGFF 
DIPVDHLMGAPLIADYFERRGWGSDYVWSPDHGGVTRARKLAEFL^ 

IIGKVEGKTCILIDDMIDTAGTICHAADALAEAGAVEVYASCTHPVLSGPATDNIQKSAIKKLVVLDTIY 
LPEERLIDKIEQISIAHLLGDAIVRIHEKRPLSPLFDIEKKI 

SEQ ID NO: 154 
>AnrPl80141 

MYDYLIVGAGLFGAVFAHESALKGKKVKVIEKRNHIAGNIYTREEEGIQVHQYGAHIFHTSDKEIWDYVN 
QFAEFNRYTNSPVANYKGEIYNLPFNMNTFNKLWGWTPAEAQAKIEEQRAILNGKTPENLKEQAISLVG 
TDIYEKLIKDYTEKQWGKPTTNFHPLLFRRLPVHLTYDNNYFNDTYQGIQLGGYTQIVEKMIiDYENIDVE 
TWISLWTKEQYLEDFPKIVLTGMIDEFFDYKLAELEYRSLRFENETLDMENYQGNAVVNYTDAETPYTR 

IIEHKHFEFGSQAKTIITKEHSKTWEKGDEPYYPVNNDRNNHLYKSYKKFADEQGNVIFGGRLGHYRYYD 
MHQVIGAALQCVRNELD 

SEQ ID NO: 155 
>AnrP786510 

MTE YKNI I VTGGAGF I GSNFVH YVYENF PDVHVTVLDKLTYAGNRANI E E I LGNRVELWGD I ADAELVD 

KLAAQADAIVHYAAESHNDNSLNDPSPFIHTNFIGTYTLLEAARKYDIRFHHVSTDEVYGDLPLREDLPG 

HGEGPGEKFTAETKYNPSSPYSSTKAASDLIVKAWRSFGVKATISNCSNl^YGPYQHIEKFIPRQITNIL 

SGIKPKLYGEGKNVRDWIHTNDHSSGVWTILTKGQIGETYLIGADGEKNNKEVLELILKEMGQAADA 

VTDRAGHDLRYAIDASKLRDELGWKPEFTNFEAGLKATIKWYTDNQEWWKAEKEAVEANYAKTQEIITV 

SEQ ID NO: 15 6 
>AnrP2 82312 

MNAIQESFTDKLFANYEANVKYQAIENAASHNGIFAALERRQSHVDNTPVFSLDLTKDKVTNQKASGRCW 
MFAALNTFRHKLISQYKLENFELSQAHTFFWDKYEKSNWFLEQVIATSDQELTSRKVSFLLQTPQQDGGQ 
WDMWSLFEKYGWPKSVYPESVSSSSSRELNAILNKLLRQDAQILRDLLVSGADQATVQAKKEDLLQEI 
FNFLAMSLGLPPRKFDFAYRDKDNNYKSEKGITPQEFYKKYVNLPLEDYVSVINAPTADKPYGKSYTVEM 
LGNWGSRAVRYINVPMERLKELAIAQMQAGETWFGSDVGQLSNRKAGILATDVYDFESSMDIKLTQDK 
AGRLDYSESLMTHAMVLTGVDLDENGKSTKWKVENSWGDKVGTDGYFVASDAWMDEYTYQIWRKELLTA 
EEQAAYGAEPIVLAPWDPMGALAE 

SEQ ID NO: 157 
>AnrP392 889 
MAKLTVKDVDLKVKKVL^ 

GKSLAPVAADLAAKLGQDWFPGVTRGAKLEEAINALEDGQVLLVENTRFEDVDGKKESKNDEELGKYWA 
SLGDGI FVNDAFGTAHRAHASNVGI S ANVEKAVAGFLLENE I AYI QEAVETP ERPFVAI LGGSKVSDKI G 
VI ENLL EKADKVL I GGGMT YTF YKAQG I E I GNS LVE EDKLDVAKDLL EKSNGKIi I L PVD S KE ANAF AG YT 

EVRDTEGEAVSEGFLGLDIGPKSIAKFDEALTGAKTVVWNGPMGVFENPDFQAGTIGVMDAIVKQPGVKS 
I IGGGDSAAAAINLGRADKFSWI STGGGASMELLEGKVLPGLAALTEK 
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SEQ ID NO: 158 
>AnrP197227 

MAIVSAEKFVQAARDNGYAVGGFNTNNLEWTQAILRAA 

VESMGITVPVAIHLDHGHYEDALECIEVGYTSIMFDGSHLPVEENLKLAKEWEKAHAKGISVEAEVGTI 
GGEEDGIIGKGELAPIEDAKAMVETGIDFLAAGIGNIH 

SGIPDEQIQAAIKLGVAKVIWNTECQIAFANATRKFARDYEANEAEYDKKKLFDPRKFIjADGVKAIQASV 
EERI DVFGS EGKA 

SEQ ID NO: 159 
>AnrP262285 

WSTKTQIAGFEFDNCLMSTAAGVACMTIEELEEVKNSAAGTFVTKTATLDFRQGNPEPRYQDVPLGSINS 
MGIiPNNGLDYYLDYLLDLQEKESNRTFFLSLVGMSPEETHTILKKVQESDFRGLTELNLSCPNVPGKPQI 
AYDFETTDRILAEVFAYFTKPLGIKLPPYFDIVYFDQAAAIFNKYPLKFVNCVNSIGNGIiYXEDESWIR 
PKNGFGGIGGEYIKPTALAJWHAFYQRLNPQIQIIGTGGVLTGRDAFEHILCGASWQVGTTLHKEGVSA 

FDRITNELKAIMVEKGYESLEDFRGKLRYID 

SEQ ID NO: 160 
>AnrP274973 

MTIMSIGIIIASHGEFAAGIHQSGSMIFGEQEKVQWTFMPNEGPDDLYAKFNNAVAAFDAEDEVLVLAD 
LWSGSPFNQASRWGENPERKFAIITGLNLPMLIQAYTERLMDAAAGVEKVAANIIKEAKDGIKALPEEL 

NPVEEVASAAAAPVAQTAI PEGTVIGDGKLKINLARLDTRL.LHGQVATAWTPDSKANRI I VAS DNVAKDD 
LRKELIKQAAPGNVKAWVPIQKLIEISKDPRFGETHALILFETPQDALRAIEGGVPIKTLWGSMAHST 

GKTLWTVLSMDKEDVATFEKMRDLGVEFDVRKVPISroSKKDLFDLINKANVK 

SEQ ID NO: 161 
>AnrP178361 

MKDLTKYKGVT PAFYACYDENGEISQDRVKSLVQYFIDKGVKGI YVNGS SGECI YQSVEDRKQI IEAVME 

VAKGKLTVINHIACNNTKDSIELAKHSESVGVDAIAAIPPIYFKLPEYSIAAYWNAMSEAASNTD 

IPQLAGVALTGSLYATMRQNPRVIGVKNSSMPVQDIQMFVAAGGEDYIVFNGPDEQYLGGRLMGAEAGIG 

GTYGVMPDLFLKLESLIQERDLDTAKKLQYAINEVIYKMISGKANMYAVAKEVL 

ALAEGDIiEVAKQAAELIQQARKEFL 

SEQ ID NO: 162 
>AnrP2 60458 

MKNPF FERRC RYS I RKL S VGAC S LMI GAVL FAG PALAEETAVP ENS GANTELVS GE S EH S TNEADKQNEG 
EHARENKLEKAEGVAIASETASPASNEAATTETAEAASAAKPEEKASEWAETPSAEAKPKSDKETEAKP 

EATNQGDESKPAAEANKTEKEVQPDVPKNTEKTLKPKEIKFNSWEELLKWEPGAREDDAINRGSVVLASR 

RTGHLVNEKASKEAKVQALSNTNSKAKDHASVGGEEFKAYAFDYWQYLDSMVFWEGIjV 

GVPVYGTL FFNWSNS I ADQERF AEALKQDADG SF P I ARKLVDMAKYYGYDGYF INQETTGDLVKPLGEKM 

RQFMLYSKEYAAKVNHPIKYSWYDAMTYNYGRYHQDGLGEYNYQFMQPEGDKVPADNFFANFN^ 

YTIATANWIGRNPYDVFAGLELQQGGSYKTKVKWNDILDENGKLRLSLGLFAPDTITSLGKTGEDYHKNE 

DIFFTGYQGDPTGQKPGDKDWYGIANLVADRTPAVGNTFTTSFNTGHGKKWFVDGKVSKDSEWNYRSVS 

VLPTWRWWQTSTGEKIiRAEYDFTDAYNGGNSLKFSGDVAGKTDQDVRLYSTKLEVTEKTKLRVAHKGGKG 

SK\m^FSTTPDYKFDDADAWKELTLSDNWTNEEFDLSSLAGKTIYAVKLFFEHEGAVKDYQFNLGQIiTI 

SDNHQEPQSPTSFSWKQSLKNAQEAEAWQFKGNKDADFYEVYEKDGDSWKLLTGSSSTTIYLPKVSRS 

ASAQGTTQEIiKWAVGKNGVRSEAATTTFDWGMTVKDTSIiPKPLAENIVPGATVIDSTFPKTEGGEGIEG 

MLNGTITSLSDKWSSAQLSGSVDIRLTKPRTWRWVM^ 

AKEVRGNKAHVTDITLDKPITAQDWRLNWTSDNGTPWKAIRIYNWKMYEKLDTESW 
NNKVQVGFADVPAGATITVYDNPNSQTPLATLKSEVGGDLASAPLDLTNQSGLLYYRTQLPGKEISNVLA 
VSVPKDDRRIKSVSLETGPKKTSYAEGEDLDLiRGGVLRVQYEGGTEDELIRLTHAGVSVSGFDTHHKGEQ 
NLTLQYLGQPWANLSVTVTGQDEASPKTILGIEVSQEPKKDYLVGDSLDLSEGRFAVAYSNDTMEEHSF 

TDEGVE I SGYDAQKTGRQTLTLHYQGHEVSFDVIjVS PKAALNDEYLKQKL AEVEAAKNKWYNFAS S EVK 

EAFLKAIEAAEQVLKDHETSTQDQVNDRLNKLTEAHKALNGQEKFTEEKTELDRIiTGEVQEIiLAAK 

SGSALAPLLEKNKALVEKVDLSPEELTTAKQSLKDLVAIiLKEDKPAVFSDSKTGVEVHFSNKEKTVIKGL 

KVERVQASAEEKKYFAGEDAHVFEIEGLDEKGQDVDLSYASIVKIPIEKDKKVKKVFFLPEGKEAVELAF 

EQTDSHVIFTAPHFTHYAFVYESAEKPQPAKPAPQNTVLPKPTYQPTSDQQKAPKLEVQEEKVAFHRQEH 

ENTEMLVGEQRVI IQGRDGLIiRHVFEVDENGQRRIjRSTEVIQEAI PEIVEIGTKVKTVPAWATQEKPAQ 
NTAVKSEEASKQLPNTGTADANEALIAGLASIiGLASLALTLRRKREDKD 

SEQ ID NO: 163 
>AnrP296493 

MKKNRVFATAGLVLLiAAGVTjAAC s s SKS sds sapkaygyvytadpetldyli srknsttwtsngidglf 

TNDNYGNLAPAVAEDWEVSKDGLTYTYKIRKGVK^ 
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NSVKGLADYLSGTSTDFSTVGVKAVDDYTI-QYTLNQPEPFWNSKLTYSIFWPLNEEFETSKGSDFAKPTD 

PTSLLYNGPFLLKGLTAKSSVEFVKNEQYWDKENVHLDTINLAYYDGSDQESLERNFTSGAYSYARLYPT 
SSNYSKVAEEYKDNIYYTQSGSGIAGLGWIDRQSYNYTSKTTDSEKVATK^ 

AYSAQINGKDGAALAVRNLFVKPDFVSAGEKTFGDLVAAQLPAYGDEWKGVNLADGQDGLFNADKAKAEF 

AKAKKALEADGVQFPIHLDVPVDQASKNYISRIQSFKQSVETVLGVENVVVDIQQMTSDEFLNITYYAAN 

ASSEDWDVSGGVSWGPDYQDPSTYLDILKTTSSETTKTYLGFDNPNSPSWQVGLKEYDKLVDEAAKETS 

DLIWRYEKYAAAQAWLTDSSLFIPAMASSGAAPVLSRIVPFTGASAQTGSKGSDVYFKYLKLQDKAVTKE 
EYEKAREKWLKEKAESNEKAQKELASHVK 

SEQ ID NO: 164 
>AnrP494895 

MSQSYINVIGAGLAGSEAAYQIAERGIPVKLYEMRGVKSTPQHKTDNFAELVCSNSLRGDALTNAVGLLK 

EEMRRLGSVILESAEATRVPAGGAriAVDRDGFSQMVTEKVANHPLIEWRDEITELPTDVITVIATGPLT 

SDALAEKIHALlSroGDGFYFYDAAAPIIDVOTIDMSKVYLKSRYDKGEAAYLNAPMTKQEFMDFHEALVNA 

EEAPLNSFEKEKYFEGCMPIEVMAKRGIKTMLYGPMKPVGLEYPDDYTGPRDGEFKTPYAWQLRQDNAA 

GSLYNIVGFQTHLKWGEQKRVFQMIPGLENAEFWYGA^RNSYMDSPNLLEQTYRSKKQPIS^FFAGQM^ 

GVEGYVESAASGLVAGINAARDFKEESEVIFPETTAIGSLAHYITHADSKHFQPMNVNFGIIKELEGERI 
RDKKARYEKIAERALADLEEFLTV 

SEQ ID NO: 165 
>AnrP5715 67 

MSEKSREEEKLSFKEQILRDLEKVKGYDEVTjKEDEAVVRTPANEPSTEELMADSLSTVEEIMRKAPTVPT 
HPSQGVPASPADEIQRETPGVPSHPSQDVPSSPAEESGSRPGPGPVRPKKLEREYNETPTRVAVSYTTAE 
KKAEQAGPETPTPATETVDIIRDTSRRSRREGAKPVKPKKEKKSHVKAFVISFLVFLALLSAGGYFGYQY 
VLDSLLPIDANSKKWTVGIPEGSNVQEIGTTLEKAGLVKHGLIFSFYAKYKNYTDLKAGYYNLQKSMST 
EDLLKELQKGGTDEPQEPVLATLTIPEGYTLDQIAQAVGQLQGDFKESLTAEAFLAKVQDETFISQAVAK 
YPTLLESLPVKDSGARYRLEGYLFPATYSIKESTTIESLIDEMLAAMDKNLSPYYSTIKSKNLTVNELLT 
IASLVEKEGAKTEDRKLIAGVFYNRLNRDMPLQSNIAILYAQGKLGQNISLAEDVAIDTNIDSPYNVYKN 
VGLMPGPVDSPSLDAIESSINQTKSDNLYFVADVTEGKVYYANNQEDHDRNVAEHVNSKLN 

SEQ ID NO: 166 
>AnrP618213 

MSNEKNTNTWEKKDATWAHEIKGELTYEDKVIQKIIGLSLENVSGLLGIDGGFFSNLKEKIVNSDDVT 
SGVNVEVGKTQVAVDLNVIVEYQKNVPALYSEIREIVSSEVAKMTDLEIVEINVNVVDIKTKEQHEADSV 
SLQDRVSDVAESTGEFTSEQFEKAKSGLGSGFSTVQEKVSEGVEAVKGAANGWSHENTRVN 

SEQ ID NO: 167 
>AnrP628331 

MPQISKEALIEQIKDGIIVSCQALPHEPLYTEAGGVIPLLVKAAEQGGAVGIRANSVRDIKEIKEVTKL.P 

IIGIIKRDYPPQEPFITATMKEVDELAELDIEVIALDCTKRERYDGLEIQEFIRQVKEKYPNQLLMADTS 

IFEEGLAAVEAGIDFVGTTLSGYTSYSPKVDGPDFELIKKLCDAGVDVIAEGKIHTPEQAKQILEYGVRG 
IWGGAITRPKEITERFVASLK 

SEQ ID NO: 168 
>AnrP662295 

MFASKSERKVHYSIRKFSIGVASVWASLVMGSVVHATENEGITQVATSYNKANESQTEHRKAAKQVDED 

IKK^SEIQEYIKKMLSEIQLDKRKHTQNVNLNRKLSAIQTKYLYELRVLKEKSKKEELTSKTKKELDAA 

FEKFKKEPELTKKLAEAKQKAKAQKEEDFRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKPRNEEK 

IKQAKAKVESKKAEATRLEEIKTERKKAEEEAKRKAEESEKKAAEAKQKVDTKEQGKPKRRAKRGVSGEL 

ATPDKKENDAKSSDSSVGEETLPSPSLNMANESQTEHRKDVDEYIKKMLSEIQLDRRKHTQNVNLNIKLS 

AIKTKYLYELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEKKVAEAEKKVEEAKKKAKDQKEEDR 

RNYPTNTYKTLELEIAESDVKVKKAELELVKEEANESRNEEKIKQAKEKVESKKAEATRLEKIKTDRKKA 

EEEAKRKAEESEKKAAEAKQKVDAEEYALEAKIAELEYEVQRLEKELKEIDESDSEDYLKEGLRAPLQSK 

LDTKKAKLSKLEELSDKIDELDAEIAKLWQLKDAEGKnxnWEAYFKEGLEKTTAEKKAELEKAEADLKKA 

VDEPETPAPAPQPAPAPEKPAEKPAPAPAPEKPAPAPEKPAEKPAEKPAEEPAEKPAPAPEKPAPTPEKP 
APTPETPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWY 

SEQ ID NO: 169 
>AnrP72010 

NTPQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDKEFYSYP 
™^" AFETGLAraTAT ^ LMAIS ^ P Q L NKSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLG 
HIALIDSEITKTVMELGIDDTKEEVVLLRHVIL.SHHGLLEYGSPVRPRIMEAEIIHMIDNLDASMMMMST 
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ALALVDKGEMTNKIFAMDNRSFYKPDLD 

SEQ ID NO: 170 
>AnrP7572 

MKKNRVF ATAGLVLLAAGVIjAAC SSSKSSDS S APKAYGYVYTAD PETLDYL I S S KNSTTVVTSNG I DGLiF 
TNDNYGNL APAVAEDWE VS KDGLT YT YK I RKGVKWF T S D GEE YAE VTAKDF VNGLKHAADKK S E AMYL AE 
NSVKGLADYLSGTSTDFSWGVKAVDDYTLQYTLN^ 
PTSLLYNGPFLLKGLTAKSSVEFVKNEQYWDK^^ 

SSNYSKVAEEYKDNIYYTQSGSGIAGLGVNIDRQSYNYTSKTTDSEl^ 

AYSAQINGKDGAALAVRNLFVKPDFVSAGEKTFGDLVAAQLPAYGDEWKGVNLADGQDGLFNADKAKAEF 
AKAKKALEADGVQFPIHLDVPVDQASKNYISRIQSFKQSVETVLGVENVVVDIQQM 

ASSEDWDVSGGVSWGPDYQDPSTYLDILKTTSSETTKTYLGFDNPNSPSWQVGLKEYDKLVDEAARETS 

DLlWRYEKYAAAQAWLTDSSLFIPAMASSGAAPVIiSRIVPFTGASAQTGSKGSDVYFKYLKSQDKW 

EYEKAREKWLKEKAESNEKAQKELASHVK 

SEQ ID NO: 171 
>AnrP770375 

MPITSLEIKDKTFGTRFRGFDPEEVDEFLDIWRDYEDLVRANHDKNLRIKSLEERLSYFDEIKDSLSQS 
^IAQDTAERVKQAAHERSNNIIHQAEQDAQRLLEEAKYKANEIIiRQATDNAKKVAVETEELKNKSRVFH 
QRIiKSTIESQLAIVESSDWEDILRPTATYLQTSDEAFKEWSEVLGEPIPAPIEEEPIDMTRQFSQAEMA 
ELQARI EVADKELS EFEAQ I KQEVEAPTP WS PQVEEE PLLI QLAQCMKNQK 

SEQ ID NO: 17 2 
>AnrP900265 

MKKKFALSFVALASVALLAACGEVKSGAVNTAGNSVEEKTIKIGFNFEESGSLAAYGTAEQKGAQLAVDE 

INAAGGI DGKQ I E WDKDNKS ETAEAAS VTTNLVTQ S KVS AVVGP AT SGATAAAVANATKAGVPL ISP S A 
TQDGLTKGQDYLFIGTFQDSFQGKIISNYVSEKLNAKKWLYTDNASDYAKGIAKSFRESYKGEIVADET 
FVAGDTDFQAALTKMKGKDFDAIWPGYYNEAGKIVNQARGMGIDKPIVGGDGFNGEEFVQQATAEKASN 
I YF I S GF S TTVEVS AKAKAFLDAYRAKYNEE P S TFAALAYD S VHL VANAAKGAKNSGE I KNNLAKTKDF E 
GVTGQTS FDADHNTVKTAYMMTMNNGKVE AAE WKP 

SEQ ID NO: 173 
>AnrP906899 

MSDLKKYEGVIPAFYACYDDQGEVSPERTRALVQYFIDKGVQGLYVNGSSGECIYQSVEDRKLILEEVMA 
VAKGKLTIIAHVACNNTKDSMELARHAESLGVDAIATIPPIYFRLPEYSVAKYWNDISSAAPNTDWIYN 
IPQLAGVALTPSLYTEMLKNPRVIGVKNSSMPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIG 
GTYGAMPELFLKLNQLIADKDLETARELQYAINAIIGKLTSAHGNlXtYGVIKEVIiKINEGLNIGSVRSPIiT 

PVTEEDRPWEAAAALIRETKERFL 

SEQ ID NO: 174 
>AnrP913599 

MKKNIKQYVTLGTVVVLSAFVANSVAAQETETSEVSTPKLVQPVAPTTPISEVQPTSDNSSEVTVQPRTV 
ETTVKDPSSTAEETPVIiEKNNVTLTGGGENVTKELKDKFTSGDFTWIKYNQSSEKGLQAIiFGISNSKPG 
QQNSYVDVFLRDNGELGMEARDTSSNKNNLVSRPASWGKYKQEAVTNTVAWA^SVKKTYSLYANGTKV 
VEKKVDNFLNIKDIKGIDYYMLGGVKRAGKTAFGFNGTLENIKFFNSALDEETVKKMTTNAVTGHLIYTA 
NDTTGSNYFRIPVLYTFSNGRVFSSIDARYGGTHDFLNKINIATSYSDDNGKTWTKPKLTLAFDDFAPVP 
LEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLMFADVMPAGVSFREATRKDSGYKQIDGNYYLKLRKQ 
GDTDYNYTIRENGTVYDDRTNRPTEFSVDKNFGIKQNGNYLTVEQYSVSFENNKKTEYRNGTKVHMNIFY 
KDALFKWPTNYIAYISSNDHGESWSAPTLLPPIMGLNRNAPYLGPGRGIIESSTGRIIiIPSYTGKESAF 
IYSDDNGASWKVKWPLPSSWSAEAQFVELSPGVIQAYMRTNNGKIAYLTSKDAGTTWSAPEYDKFVSNP 
SYGTQLSIINYSQLIDGKKAVILSTPNSTNGRKHGQIWIGLINDDNTIDWRYHHDVDYSNYGYSYSTLTE 

IjPNHEIGLMFEKFDSWSRNELHMKNVVPYITFKIEDLjKKN 

SEQ ID NO: 175 
>AnrP973305 

MSQIWTKEKFISQVQGGVIVSCQALPGEALYNEEFSLMPFMAKAALEAGAVGIRANSVRDIKAIQKW 
PIIGIIKRDYPPQEPYITATMKEVDELVECGTTVIAFDAT^ 

SNLDEGLYAFKSGVDFVGTTLSGYTSTSVQSDEPDFELMKKLADFNIPVIAEGKIHYPEQLKKAYSLGVT 
S WI GGAI TRPKE IAQRF INVIK 
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>Arir P5 70195 

MKSITKKIKATLAGVAALFAVFAPSFVSAQ 

VDQELVIDGPVAPVATPAPATYAAPAAQDETVSAPVAETPWSETWSTVSGSEAEAKEWIAQKESGGSY 
TATNGRYIGRYQLTDSYLNGDYSAENQERVADAWAGRYGSWTAAKNFWLNNGWY 

SEQ ID NO: 177 
>AnrP506333 

MKKIVKYS SLAAL ALVAAGVLAAC S GGAKKEGEAASKKE 1 1 VATNGS PKP F I YEENGELTGYE I EWRAI 
FKDSDKYDVKFEKTEWSGVFAGLDADRYNMAV^^ 

DDIGGKSTEWQATTSAKQLEAYNAEHTDNPTILNYTKADLQQIMVRLSDGQFDYKIFDKIGVETVIKNQ 
GLDNLKVIELPSDQQPYVYPLLAQGQDELKSFVDKRIKELYKDGTLEKLSKQFFGDTYLPAEADIK 

SEQ ID NO: 17 8 
>AnrP4742 

MNLLIMGIjPGAGKGTQAAKIVEQFHVAHISTGDMFRAAMANQTEMGVLAKSYIDKGELV 

RliSQDDIKETGFLLDGYPRTIEQAHALDKTLAELGIELEGVINIEVNPDSLLERLSGRIIHRVTGETFHK 

VFNPPVDYKEEDYYQREDDKPETVKRRLDWIAQGEPIIAHYRAKGLVHDIEGNQDINDVFSDIEKVLTN 

LK 

SEQ ID NO: 179 
>AnrP867168 

MVKIGLFCAAGF STGMLVNNMKI AAQS SGVEAEIEAF S QSKLADYAPNIDVAIiLGPQVAYTLDKSKE I CD 
KCDVP I AVI PMMD YGML DGKKVLDL AL S L I S G 

SEQ ID NO: 18 0 
>AnrPl50728 

MRIFASPSRYIQGENALFENAKSILDLGNYPILDCDQLVYDIVGKRFEDYLHRYGFHIVLALFNGEASDN 
EINRWALAEKENCDSIIGLGGGKTIDSAKAIADLIEKPVIIAPTIASTDAPVSALSVIYTDEGAFDHYL 
FYSKNPDLVLVDTKVISQAPKRLLASGIADGLATWEARAVMQANGKTMLGQQQTLAGVAIAKKCEETLF 
ADGLQAMAACEAKWTPALENIVEANTLLSGLGFESGGLAAAHAIHNGFTALTGDIHHLTHGEKVAYGTL 
VQLLLENRPKEELDKYIEFYKKIGMPTTLKEMHDDQVGYDDLIKYGKQAT3XIEGETIHQMPFKISPSDVAQ 

AI I AVDAYVNSK 

SEQ ID NO: 181 
>AnrP2 64781 

MRKTPSHTEKKMVYS IRSLKNGTG^ 

ADSGKNESDISSPRNANASLEKTEEKPATEPTTSTSPVTTETKAEEPIEDNYFRIHVKKLPEENKDAQGL 

, WTWDDVEKPSENWPNGALSFKDAKKDDYGYYLDVKLKGEQAKKI SFLINNTAGKNLTGDKSVEKLVPKMN 
EAWLDQDYKVFSYEPQPAGTVRV^YRTDGNYDKKSLWYWGDVKNPSSAQWPDGTDFTATGKYGRYIDIP 
LNEAAREFGFIiLLDESKQGDDVKIRKENYKFTDLKNHSQIFLKDDDESIYTNPYWHDIRMTGAQHVGTS 
SIESSFSTLVGAKKEDILKHSNITNHLGNKVTITDVAIDEAGKKVTYSGDFSDTKHPYTVSYNSDQFTTK 

TSWHLKDETYSYDGKLGADLKEEGKQVDLTLWSPSADKVSVVVYDKNDPDKW 

DSTNKLGITDFTGYYYQYQIERQGKTVLALDPYAKSLAAWNSDDAKIDDAHKVAKAAFVDPAKLGPQDLT 
YGKIHNFKTREDAVIYEAHVRDFTSDPAIAKDLTKPFGTFEAFIEKLDYLKDLGVTHIQLLPVIiSYYFVN 

ELKNHERLSDYAS SNSNYNWGYDPQNYF SLTGMYS SDPKNPEKRI AEFKNL INE I HKRGMGAILDWYNH 
TAKVDIFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHHMTKRLLVDSIKYLVDTYKVDGFRFDMMGDHDA 

ASIEEAYKAARALNPNLIMLGEGWRTYAGDENMPTKAADQDWMKHTDTVAVFSDDIRN^ 

AF ITGGKRDVNT I FKNIj I AQ PTNF EAD S PGDVI Q Y I AAHDNLTLFD I 1 AQ S I KKD P S KAENYAE I HRRLR 
LGNLMVLTAQGTPFIHSGQEYGRTKQFRDPAYKTPVAEDKVPNKSHLLRDKDGNPFDYPYFIHDSYDSSD 
AVNKFDWTKATDGKAYPENVKSRDYMKGLIALRQSTDAFRLKSLQDIKDRVHLITVPGQNGVEKEDVVIG 
YQ I TAPNGD I YAVFVNADEKAREFNLGTAFAHLRNAEVLADENQAGS VG I ANPKGLEWTEKGLKLNALTA 
TVLRVSQNGTSHESTAEEKPDSTPSKPEHQDPAPEARPDSTKPDAKVADAENKPSQATADSQAEQPAQEA 
QASSVKEAVQNESVENSSKKNIPATPDRQAELPNTGIKNENKLLFAGISLBAIiLGLGFIiLKNKKEN 

SEQ ID NO: 182 
>AnrP641284 

MILQYVYWSVYMQTKTKKLIVSLSSLVLSGFLLNHYMTVGAEETTTNTIQQSQKEVQYQ 

DFGQTEDGSSPWTGSKAQGWSAWVDQKNSSADASTRVIEAKDGAITISSPEKLRAAVHRMVPIEAKKKYK 

LRFKIKTDNKVGIAKVRIIEESGKDKRLWNSATTSGTKDWQTIEADYSPTLDVDKIKLELFYETGTGTVS 

FKDIELVEVADQPSEDSQTDKQLEEKIDLPIGKKHVFSLADYTYKVENPDVASVKNGILEPLKEGTTNVI 

VSKDGKEVKKI PLKILASVKDTYTDRLDDWNGI IAGNQYYDSKNEQMAKLNQELEGKVADSLS S ISSQAD 

RI YLWEKF SNYKTS ANLTATYRKLEEMAKQVTNP SSRYYQDETVVRTVRDSMEWMHKHV^ I VGNW 

WDYEIGTPRAINNTLSLMKEYFSDEEIKKYTDVIEKFVPDPEHFRKTTDNPFKALGGNLVDMGRVKVIAG 
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LLRKDDQEISSTIRSIEQVFKLVDQGEGFYQDGSYIDHTWAYTGAYGISrVLIDGLSQLLiPVIQKTKNPID 
KDKMQTMYHWIDKSFAPLLVNGELMDMSRGRS I SRANSEGHVAAVEVLRGIHRIADMSEGETKQRLQSLV 
KT I VQ SD S YYDVF KNL»KT YKDI SLiMQ SLL SD AGVASVPRTS YLS AFNKMDKTAMYNAEKGFGFGL SLF S S 
RTLNYEHMNKENKRGWYTSDGMFYL^^ 
GTSKLDDANATATMDFTiKn^ 

NDKEASLTEQEKDYPETQSVFLESSDSKKNIGYFFFKKSSISMSKALQKGAWKDINEGQSDKEVENEFLT 
ISQAHKQNGDSYGYMLIPNVDRATFNQM 

QVLKRGVYTIRKEGDEYKIAYYNPETQESAPDQEVFKKLEQAAQPQVQNSK^ 
TGEGQS ILASLGFLLLGAFYLFRRGKNN 

SEQ ID NO: 183 
>AnrP136162 

MTNTSFSIEQFSLKGKIALITGASYGIGFAIAKSYAEAGATIVFNDINQDL^ 
CDVTDEDGIQAWKQIEQEVGVIDILVNNAGIIRRVPMCEMSAADFRKVIDIDL3XTAPFIVS 
KGHGKIINICSMMSEIiGRETVSAYAAAKGGLKMLTRNIASEYGGANIQCNGIGPGYIATPQTAPLRELQE 
DGSRHPFDQFIIAKTPAARWGNPEDLMGPAVFLASDASNFVNGHILYVDGGILAYIGKQPE 

SEQ ID NO: 184 

>AnrP97557 

MNNNFNNFNNMDDL^ 

KQGGVLAKLGRNLTAEAREGKIjD P VI GRNKE I QET S EILS RRTK3SINP VLVGD AGVGKT AVVEGL AQAIVN 
GDVPAAIKNKEIISIDISGIiEAGTQYRGSFEENVQNLWEVKEAGNIIIiFFDEIHQILGAGSTGGDSGSK 
GLADILKPALSRGELTVIGATTQDEYRNTILKNAALARRFNEVKVNAPSAENTFKILQGIRDLYQQHHNV 
ILPDEVLKAAVDYSVQYIPQRSLPDKAIDLVDVTAAHLAAQHPVTDVHAVEREIETEKDKQEKAVEAEDF 
EAALNYKTRIAELERKIENHTEDMKVT^ 

DKAVEWARAIRRNRAGFDEGNRPIGNFLFVGSTGVGKTELAKQLALDMFGTQDAIIRIjDMSEYSDRTAV 
SKLIGTTAGWGYDDNSNTLTERVRRNPYSIILLDEI^ 

VIIATSNAGFGYEANLTEDADKPELMDRLKPFFRPEFLNRFNAVIEFSHLTKEDLSKIVDLMLAEVNQTL 
AKKDIDLWSQAAKDYITEEGYDEVMGVRPLRRWEQEIRDKVTDFHLDHLDAKHLEADMEDGVLVIREK 
V 

SEQ ID NO: 185 
>AnrP261700 

MNKGLFEKRCKYSIRKFSLGVASWIGATFFGTSPVLADSVQSGSTANLPADLATALATAKENDGHDFEA 
PKVGEDQGSPEVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQTVEKKEQQGTVTIR 
EEKGVRYNQL S ST AQNDNAGKP AL>FEKKGLTVDANGNATVDLTF KDD S EKGKSRF GVFLKFKDTKNNVF V 
GYDKDGWFWEYKSPTTST^ 

NEKKILLKAGSYDDERTWSVKTDNQEGVKTEDTPAEKETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRV 
KEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDDAHLINAEMTVRLQWDNQLH 
FDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSGDDHIDVTNPM 
KDLAKG YMYGF VS TDKLAAGVWSNS QNS YGGGSNDWTRIiTAYKETVGNANYVG I HS S EWQWEKAYKG I VF 
PEYTKELP SAKWITEDANADKKVDWQDGAI AYRS IMNNPQGWKKVKD ITAYRIAMNFGSQAQNPFLMTIi 
DGIKKINLHTDGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASET 
YPESKYFNEKILRKNPDGSYSYGWNWLDQGINIDAAYDLAHGRLiARWEDLKKKLGDGLDFIYVDVWGNGQ 
SGDNGAWATHVL AKE INKQGWRF AX EWGHGGEY r D STFHHWAADLTYGGYTNKG INS AI TRF I RNHQKD AW 
VGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNLFAHDVMTKYFQHFTVSKWENGTPVTMTDNG 
STYKWTPEMRVELVDADJSnSTKVVVTRKSNDVNS PQYRERTVTIiNGRVI QDGSAYLTPWNWDANGKKLSTDK 
EKMYYFNTQAGATTWTLP SDWAKSK^ 

MSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVT 
VGVDNRSNAKAS I TVNTGEKEVTT YTNKS LALNYVKAYAHNTRRNNATVDDT S YFQNMYAF FTTGS DVSN 
VTLTLSREAGDEATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFWGGVEGVEDNRTHLS 
EKHDPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAF 
WGKGEFQSGRRGTQASNLEMHELPNTWTDSKKAKKATFIiVTGAETGDTWVGIYSTGNASNTRGDSGGNA 
NFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMTNYTK^ 

AEIAKIEALKNALVQKKTALVADDFASLTAPAQAQEGLANAFDGNLSSLWHTSWGGGDVGKPATMVLKEA 
TEITGLRWPRGSGSNGNLRDVKLWTDESGKEHTFTATDWPDNNKPKDIDFGKTIKAKKIVLTGTKTYG 
DGGDKYQ S AAEIjI FTRPQVAET PLDL SG YE AAL AKAQKLTDKDNQEEVAS VQ ASMKYATDNHLLTERMVE 
YFADYLNQLKDSATKPDAPTVEKPEFKLSSVASDQGKTPDYKQEIARPETPEQILPATGESQFDTALFLA 
SVSLALSALFWKTKKD 
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SEQ ID NO: 18 6 27/47 

>AnrPl75901 

MKKEHKNIF ITGS SRGIGLAI AHKF AQAGANIVLNSRGAI SEELLAEF SNYGIKWP I SGDVSDFADAKR 
MIDQAIAELGSVDVLVNNAGITQDTLML^^ 
VVGLMGNIGQAJXTYAASKAGLIGFTKSVAREVASRNIRVN^ 
FGQAEQVADLTVFLAGQDYLTGQWAIDGGLSM 

SEQ ID NO: 187 
>AnrP58038 

MTFNNKTIEELHNLLVSKEISATEL/T^ 

GI PLAVKDNI STDGILTTAASKMD YNYE P I FDATAVANAKTKGMI VVGKTNMDEFAMGGSGETSHYGATK 
NAWDHSKVPGGSSSGSAAAVASGQVRLSLGSDTGGSIRQPAAFNGIVGLKPTYGTVSRFGLIAFGSSIiDQ 
IGPFAPTVKENALLLNAIASEDAKDSTSAPVRIADFTSKIGQDIKGMKIALPKEYLGEGIDPEVKETILN 
AAKHFEKLGAIVEEVSLPHSKYGVAVYYIIASSEASSNLQRFDGIRYGYRAEDATNLDEIYVNSRSQGFG 
EEVKRRIMLGTF SL S SGY YDAYYKKAGQVRTL 1 1 QDFEKVF ADYDL ILGPTAP S VAYDLDSLNHD PVAMY 
LADLLTIPVNLAGLPGISIPAGFSQGLPVGLQLIGPKHSEETIYQVAAAFEATTGYHKQQPVIFGGDN 

SEQ ID NO: 188 
>AnrP561535 

JXONfQTVEYIKELTAIASPTGFTREIADYLVKTLEGFGYQPWTSKGGVWTIKGQNDEQHRYVTAHVDTLG 
AIVRAVKPDGRLKI^RIGGFPWNMIEGENCTIH^^ 

RLDAKVTSEKETRALGIEVGDFISFDPRTWTETGFIKSRHLDDKVSAAILLNLLRIYKEEKIELPVTTH 
FAFSVFEEVGHGANSNIPAQWEYLAVDMGAMGDDQQTDEYTVSICVKDASGPYHYDFRQHIjVALAKEQD 
IPFKLDIYPFYGSDASAAMSAGAEVKHALLGAGIESSHSYERTHIDSVIATERMV^AYLKSTLVD 

SEQ ID NO: 189 
>AnrP876509 

MSVLEIKDLHVEIEGKEILKGVNLTLKTGEIAAIMGPNGTGKSTLSAAIMGNPNYEVTKG 
ELEVDERARMGLFLAMQYPSEI PGITNAEFLRAAMNAGKEDDEKI SVREFITKLDEKMELLNMKEEMAER 
YLNEGFSGGEKKRNEILQLLMLEPTFADLD 

I T PDVVHYMMEGRVVL SGGPELAARLEREGY AKLAEELGYDYKEELi 

SEQ ID NO: 190 
>AnrP394514 

MKRIAVLTSGGDAPGMNAAIRAWRQAI SEGMEVFGI YDGYAGMVAGEIHPLDAASVGDI I SRGGTFLHS 
ARYPEFAQLEGQLKGIEQLKKHGIEGWVIGGDGSYHGAMRLTEHGFPAIGLPGTIDNDIVGTDFTIGFD 
TAVTTAMDAIDKIRDTSSSHRRTFVIEVMGRNAGDIALWAGIATGADEI 1 1 PEAGFKMEDI VAS IKAGYE 
CGKKHNI IVXjAEGVMSAAEFGQKIjKEAGDT SDIjRVTELGHIQRGGS PTARDRVLASRMGAHAVKLLiKEGI 
GGVAVGI RNEKMVENP I LGTAEEGALFS LTAEGKI WNNPHKADI EL S S LNKSL S 

SEQ ID NO: 191 
>AnrP598862 

MRKKLFLTSAAILWAVTAMNSVH^ 

KTMTPDVYSKIMNVAND S SLQL YS S AKIQKLGDKS PLEVKI ETPENI TKVTQDMYFINAAVTLGMEHAKIT 
VAAP I PVTGE S ALAGI Y Y SLEANGAKVP QANKDLAQEELKAI) SD INAENKDKSGYDANKLNVAIiAD I KSG 
LAKAKESKGNLTEEDIRKIVEDTLKNYKLDQVITGNQINIIINFALNLSKSDILSNADFTKTLOT 
VS QAGDS FKNINLNFDADKALEDGGNFL S S LWQALVNF FKS FGS 

SEQ ID NO: 192 
>AnrPl67912 

MYNYPMRI HYHRKNGE YDTC S FVKSQDQRIDLLTYKEDYFGALF SFEHP S SHVI ESLNF WHTGQTSKE Y 
SIRFNHYPLbTEVWILEGDDRIYYSENPAIASPFYKNQNPFAFDKAINSASFDHHWGYQGELiGCRVEDNQ 
AHFSLWSPTATEVQVVWESAANDAPWKTFEMKRGNSYSYNHKDNTIGWSLDVEEDLVGKTYQYQVQF 
PHHQTLTRDPYTIATSPDGKRSAILSHVEKQVENFEVKHGSEATWRIiENPCKAVICEMHIRDLTKSPTSG 
VDEHLRGTFLGAAQAGTVNQYGQSTAFDYIKKLGYNWQLQPIADRHKEYDEDGNVTYNWGYDPQNYNAP 
ETSFSTNPDDPAQVIRDLKVMVQAYHDAGIGVIMDWYNHT^ 

GVGNETASEHEMFRKYMIDSLLYWQEYNIDGFRFDLMGIHDVKTMQMIRQSLDEIDSNIILYGEGWDMG 
TGLAPYDKAKKDNAYQMPNIGFFNDNQRDAVKGGEVYGAIKSGFVSGAATEPIIiAKAlLGSRELGSYTHP 
NQVLNWEAHDNYNLHDL^ 

LTHDDRERAMNSYNAPDSWQVNWLINERQDSIEFIRQVIRLKTKTGAFSYSS 
SGCIilYEVHGKEHLLVVVNAKSEPYQFENAGNLAMLVTNSRSKEDNVIiNDISLAVLSVL 
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SEQ ID NO: 193 28/47 

>AnrP41710 

MTSTKQHKKVIWGDGAVGSSYAFALWQGIAQELGIIEIPQLHEKAVGDALDLSHALAFTSPKKIYAAQ 
YSDCADADIWVITAGAPQKPGETRLDLVGKNL^ 

FPKERVIGSGTSLDSARFRQALAEKLDVDARSVHAYIMGEHGDSEFAWSHANIAGWLEEFLKDTQNVQ 
EAELIELFEGVRDAAYTIINKKGATYYGIAVALARITKAILDDENAVLPLSVFQEGQYGVENVFIGQPAV 
VGAHGIVRPVNI PLNDAETQKMQASAKELQAI IDEAWKNPEFQEA 

SEQ ID NO: 194 
>AnrP192124 

MKKRKKLALSLIAFWLTACLVGCASWIDRGESITAVGSTALQPLVEVAADEFGTIHVGKTV1WQGGGSGT 
GJjSQVQS GAVD I GNSDVF AEEKDGI DAS ALVDHKVAVAGLAL I VNKEVDVDNLTTEQLRQ I F I GEVTNWK 
EVGGKDIiPISVINRAAGSGSRATFDTVIMEGQSAMQSQEQDSNGAVKSIVSKSPGAISYLSLTYIDDSVK 

SMKIjNGYDLSPENISSNNWPL^ 
AAGTVTVLEGRQ 

SEQ ID NO: 195 
>AnrP609662 

MYDTIIIGAGPAGMTAALYAARSNLKVALIEGGLPGGQM^ 

GVEHIYGYVENVEDHGDFKKVMTDDQTYETRTVIVATGSKHRPLGVPGEEELNSRGVSYCAVCDGAFFRD 
QDLLWGGGDSAVEEALFLTRFAKTVTIVHRRDQLRAQKVIiQDRAFANEKISFIWDSVVREIKGENRVES 
WFENVKTGQVTEQAFGGVFIWGLDPLSDFVKELNIQDQAGWIVTDNHMKTAVDGIFAVGDVRL.KDLRQ 

VTT AVGDGAI AGQEAYKF I TEHS 

SEQ ID NO: 196 
>AnrP7572 62 

MSKIVWGANHAGTACINTMLDNFGNENEIWFDQNSNISFLGCGMALWIGEQIDGAEGLFYSDKEKIiEA 
KGAKVYMNS PVL S IDYDNKWTAEVEGKEHKE S YEKL I FATGST P I L P P I EGVE I VKGNRE FKATLENVQ 
FVKL YQNAEEVINKL SDKS QHLDRI AWGGGYI GVEL AEAF ERLGKEVVTjVD I VDTVLNGYYDKDFTQMM 
AKNIiEDHNIRLALGQTVKAIEGDGKVERLITDKE 

KQETS I PGVYAVGDCATVYDNARKDTS YIALASNAVRTGIVGAYNACGHELEGIGVQGSNG I S I YGLHMV 
STGLTLEKAKAAGYNATETGFNDLQKPEFMKHDNHEVAIKIVFDKDSREILGAQMVSHDIAISMGIHMFS 
LAI QEHVT I DKLALTDLFFL PHFNKPYNYI TMAALTAEK 

SEQ ID NO: 197 
>AnrP437818 

MTRYQDDFYDAINGEWQQTAEIPADKSQTGGFWDDQEIEDLMLATTDKWLAGEEVPEDAILENFVKYHR 
nWDFDKREADGITPVLPLLKEFQEDETFADFTAKLAEFELAGKPNFLPFGVSPDFMDARINVLWASAPS 
TIIjPDTTYYAEEHPQREELLTLWKESSANLLK 

i/yhpysyedfkkfapalplddffkavigqlpdkvivdeerfwqaaeqfyseesw 
syltedirvlsgaysralsgvpeakdkvkaayhlaqepfk^^ 
ykerllkndwltpetckqaivkliwik^ 
nqpvdykewgmpahmvnayynpqkis^^ 

fdengslkdwwtesdyaafkektqwidqfdgqdsygatingkltvsewadlggiaaaleaakreadfs 
aeeffynfgriwrmkgrpefmkllasvdvhapakl^ 

SEQ ID NO; 198 
>AnrP166452 

wklvfarhgesewnkanlftgwadvdl^ 
asdqlwpvekswrlnerhyggltgknkaeaaeqfgdeq 

lddsvi pd aenlkvtlerad p fwedki apalkdgknvfvgahgns i ral vkhi kglsdde imdve i pnf p 
plvfefdeklnwse yylgk 

SEQ ID NO: 199 
>AnrP270182 

l^REGFFTGLDIGTSSVKVWAEQRNGELNV^ 

SIKSVNVGLPGNLLQVEPTQGMIPVTSDTKEITDQDVENWKSALTKSMTPDREVITFIPEEFIVDGF 
I RD PRGMMGVRLEMRGDLYTG PRT I LHNLRKTVERAGVQVENVI I S PDAMVQS VL.NEGEREFGATVIDMG 
AGQTTVATIRNQEDQFTHILQEGGDWTKDISKVIjK^ 

E AVEVTEAYL S E 1 1 SARI KH I LEQ I KQELDRRRLIjDL PGGIVL I GGNAI L PGMVELAQEVFGVRVKIjYVP 
NQVGI RNPAFAHVI SLSEFAGQLTEVNLIjAQGAI KGENDD SHQP I SFGGMLQKTAQF VQ S T PVQ PAPAPE 
VEPVAPTEPMADFQQASQNKPKLADRFRGLIGSMFDE 
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SEQ ID NO: 2 00 29/47 

>AnrP348202 

MS YFRNRD I D I ERNSMNRS VQERKCRY S I RKL SVGAVSMI VGAWFGT S PVLAQEGASEQPLANETQLSG 

ESSTLTDTEKSQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLS 

SELDKLKKLENATVHMEFKPDAKAPAFYNLFSVSSATKK^ 

APLKVKPGQWNSVTFTVEKPTAELPKGRVRL^ 

GSISTDQIRNIjTVYNRALTPEEVQKRSQIjFK^ 

LLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKT^ 

VQDPETKRIFSIYDMFPEGKGIFGMSSQKEEAYKKIDGKTYQILYREGEKGAYTIRENGTVYTPDGKATD 

YRVVVDPVKPAYSDKGDLYKGNQDLGNIYFTTNKTSPFRIAKDSYLmSYSDDDGKTWSAPQDITPMVKA 

DWMKFLGVGPGTGIVLRNGPHKGRIL.I PWTTNWSHLNGSQSSRI IYSDDHGKTWHAGEAVNDNRQVDG 

QKI H S S TMNNRRAQNT E S TVVQLNNGDVKL FMRGLT GDLQVAT S KDGGVT WEKD I KRY P QVKDVYV QMS A 

IHTMHEGKEYIILSNAGGPKRENGJVIVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHT 

EKGQNAYTLSFRKFNWDFLSKDLISPTEAKV^^ 

QYDTKTLLFTVDSEDMGQKVTGL 

PAPTVEKPEYTGPLGTSGEEPAPTVEKPEYTGPLGTAGEEAAPTVEKPEFTGGVNGTEPAVHEIAEYKGS 
DSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ 

SEQ ID NO: 201 
>AnrP260849 

MAKNVVITGATSGIGEAIARAYLEQGEDVVXiTGRRIDRLEIIjKSEFAVSFPNQTVWTFPL 
VCSDILETIGRIDILVlXnsrAGLALDLAPYQDYEELDML^ 

stagiyayagaavysatkaavktfsdglridtiatdikvttiqpgivetdfstvrfhgdkeraasvyqgi 
ealqaqdiadtwyvtsqprrvqitdmtimanqqatgfmihkk 

SEQ ID NO: 202 
>AnrP68825 

MNADDTVTIYDVAREAGVSMATVSRVVNGNKNVKENTRKKVLEVIDRLDYRPNAVARGLASKKTTTVGW 
IPNITNGYFSSLAKGIDDIAEMYKYNIVLANSDEDNEKEVSWNTLFSKQVDGIIYMGYHLTDKIRSEFS 
RSRTPIVLAGTVDVEHQLPSWIDYKQATIDAVSYLAKENERIAFVSGPLVDDINGKVRLVGYKETLKKA 
G I T YS EGLVF E S KYS YDDGYAL AERL I S SNATAAVVTGDEL AAGVLNGLiADKGVS VP EDFE I ITSDDS Q I 
SRFTRPNLTTIAQPLYDLGAISMRMLTKIMHKEEIiEEREVLLPHGLTERSSTRKRK 

SEQ ID NO: 203 
>AnrP570870 

MSSKFMKSTAVLGTVTLASLLLVACGSKTADKPADSGSSEVKELTVYVDEGYKSYIEEVAKAYEKEAGVK 
VTLKTGDALGGLDKLSLDNQSGNVPDVMMAPYDRVGSLGSDGQLSEVKLSDGAKTDDTTKSLVTAANGKV 
YGAPAVI E S LVMYYNKDLVKDAPKTF ADLENLAKD S KYAFAGEDGKTTAFLADWTNF YYT YGLLAGNGAY 
VFGQNGKDAKDIGLANDGSIAGINYAKSWYEKWPKGMQDTEGAGNLIQTQFQEGKTAAIIDGPWKAQAFK 
DAKVNYGVATI PTLPNGKEYAAFGGGKAWVI PQAVKNLEASQKFVDFLVATEQQKVIjYDKTNEI PANTEA 
RS YAEGKNDELTTAVI KQFKNTQ PL PNI S QMSAEWADWL I QRI KDKGDQK 

SEQ ID NO: 2 04 
>AnrP788451 

MS S KFMKS TAVLGTVTL AS LLLVAC GS KTADKPAD S GS S E VKELTVYVDEGYKS YI EE VAKAYEKEAGVK 

VTLKTGDAIiGGLDKLSLDNQSGNVPDVMMAPYDRVGSLGSDGQLSEVKLSDGAKTDDTTKSLVTAANGKV 

YGAPAVIESLVMYYNKDLVKDAPKTFADLENLAKDSKYAFAGEDGKTTAFLADWTNFYYTYGLIiAGNGAY 

VFGQNGKDAKDI GLANDGS I AGINYAKSWYEKWPKGMQDTEGAGNLI QTQFQEGKTAAI IDGPWKAQAFK. 

DAKVNYGVATIPTLPNGKEYAAFGGGKAWVIPQAVKNLEASQKFVDFLVAT 

RSYAEGKKDELTTAVIKQFKNTQPLPNISQMSAVWDPAKHMLFDAVSGQKDAKT 

FGE 



SEQ ID NO: 2 05 
>AnrP455508 

MKKTTIL SLTTAAVILAAYVPNEP ILADTPS S E VIKETKVGS 1 1 QQNNIKYKVLTVEGNI GTVQVGNGVT 
PVEFEAGQDGKPFTIPTKITVGDKVFTVTEVASQAFSYYPDETGRIVYYPSSITIPSSIKKIQKKGFHGS 
KAKTIIFDKGSQLEKIEDRAFDFSELEEIELPASLEYIGTSAFSFSQKLKKDTFSSSSKLELISHEAFAN 
LSNLEKLTLPKSVKTLGSNL^ 

ETKELASYSFNKNSYLKKLELNEGLEKIGTFAFADAIKLEEISLPNSLETIERLAFYGNLELKELILPDN 

VKNFGKHVMNGLPKLKSLTIGNNINSLPSFFLSGVLDSLKEIHIKNKSTEFSW 

EHIKDVLKSNLSTSNDIIVEKVDNIKQETDVAKPK^ 

LWYYLNESGSl^TGWKDKGLWYYL 

MATGWVKDKGLWYYLNESGSMATGWVTVS GKWYYT YNSGDLLVNTTT PDGYRVNANGEWVG 
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SEQ ID NO: 2 06 30/47 

>AnrP33115 

ME F S KKTREL S I KKMQERTLDLL 1 1 GGG I TGAGVALQAAAS GLETGL I EMQDF AEGTS S RS TKLVHGGLR 
YLKQFDVEWSDTVSERAWQQIAPHIPKPDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPTAN 

KVLSKDQVLERQPNLKKEGL^^ 

WARDLLTDQVFEIKARLVINTTGPWSDKVRNLSNKGTQFSQMRPTKGVHLWDSSKIKVSQPVYFDTGL 
GDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVTQEDVDYL^ 

AGNSASDYNGGNNGTlSDESFDNLIATVESYLSKEKTREDVESAVSKIiESSTSEKHIiDPSAVSRGSSLDR 

DDNGLIjTLAGGKITDYRKMAEGAMERVVDIIjKAEFDRSFKL^ 

VSRGLDSKEAHYLANLYGSNAPKVFALAHSLE 

MRDSLDSIVEPVLDEMGRFYDWTEEEKATYRM)VEAALANlSrDLAELKN 

SEQ ID NO: 2 07 
>AnrP4749 68 

MEFSKKTRELSIKKMQERTLDLLIIGGGITGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGIiR 
YLKQFDVEWSDTVSERAWQQIAPHIPKPDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPTAN 
KVLSKDQVLERQPNLKKEGIiVGGGVYLDFRNNDARLVIENIKRANQDGAL^ 
WARDLLTDQVFEIKARLVINTTGPWSDKVRN^ 

GDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVTQEDVDYLIiGIVNNRFPESNITIDDIESSWAGLRPLI 
AGNSASDYNGGNNGTISDESFDNLIATVESYLSKEKTREDVESAVSKLESSTSEKHLDPSAVSRGSSLDR 
DDNGLLTLAGGKITDYRKMGDEALWSAWLTSSKQNLTVALN 

SEQ ID NO: 2 08 
>AnrP956096 

MEFSKKTRELSIKKMQERTLDLLIIGGGITGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLR 
YLKQFDVEWSDTVSERAWQQIAPHIPKSDPMLLPWDEDGATFSLFRLKVAMDLYDLLAGVSNTPAAN 

KVLSKDQVLERQPNLKKEGLVGGGWLDFRN^ 

WARDLLTDQVFEIKARLVINTTGPWSDKVRNLSNKGTQFSQMRPTKGVHLVVDSSKIKVSQPW 

GDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVTQEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLI 

AGNSASDYNGGNNGTISDESFDNLIATVESYLSKEKTREDVESAVSKLESSTSEKHLDPSAVSRGSSLDR 

DDNGLLTLAGGKITDYRKX^ 

VSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRRTNHMLF 
MRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAELKN 

SEQ ID NO: 2 09 
>AnrP794279 

MKHLKTFYKKWFQLLVVIVISFFSGALGSFSITQLjTQKSSVNNSNNNSTITQTAYKN 
AWSVITYSANRQNSVFGNDDTDTDSQRISSEGSGVIYKKNDKEA 

PGEIVGADTFSDIAWKISSEKVTTVAEFGDSSKLTVGETAIAIGSPLGSEYANTVTQGIVSSLNRNVSL 
KSEDGQAISTKAIQTDTAINPGNSGGPLINIQGQVIGITSSKIATNGGTSVEGLGFAIPANDAINIIEQL 
EKNGKVTRPALGIQMVNLSNVSTSDIRRLNIPSNVTSGVIVRSVQSNMPANGH 
STDLQSALYNHSIGDTIKITYYRNGKEETTSIKLNKSSGDLES 

SEQ ID NO: 210 
>AnrP232621 

MKHEKQQRFSIRKYAVGAASVLIGFAFQAQTVAADGVTTTTENQPTIHTVSDSPQSSENRTEETPKAEIiQ 
PEAPKTVETETPATDKVASLPKTEEKPQEEVSSTPSDKAEWTPTSAEKETANKKAEEASPKKEEAKEVD 
SKESNTDKTDKDKPAKKDEAKAEADKPETEAGKERAATVNEKLAKKKIVSIDAGRKYFSPEQIiKEIIDKA 
KHYGYTDLHLLVGJSTOGIjRFMLDDMSITANGKT 

KDKGIGLIPTWSPGHMDAILNAMKELGIQNPNFSYFGKKSARTVDLDNEQAVAFTKALIDKYAAYFAKK 

TEIFNIGLDEYANDATDAKGWSVLQADKYYPNEGYPVKGYEKFIAYANDLARIVKSHGLKPMAFNDGIYY 

NSDTSFGSFDKDIIVSMWTGGWGGYDVASSKLLAEKGHQILNTNDAWYWLGRNADGQGWYNL 

KNTPITSVPKTEGADIPIIGGIWAAWADTPSARYSPSRIjFKLMRHFANANAEYFAADYESAEQALNEVPK 

DLNRYTAESVTAVKEAEKAIRSLDSNLSRAQQDTIDQAIAKLQETVNNLTLTPEALKEEEAKREVEKLAK 

NKVISIDAGRKYFTLNQLKRIVDKASELGYSDVHLLLGNDGLRFLLDDMTITANGKTYASDDVKKAIIEG 

TKAYYDDPNGTALTQAEVTELIEYAKSKDIGLIPAINSPGHMDAMLVAMEKLGIKNPQAHFDKVSKTTMD 

LKNEEAMNFVKALIGKYMDFFAGKTKIFNFGTO 

RGLQPMAFNDGFYYEDKDDVQFDKDVLISYWSKGWWGYNLASPQYLASKGYKFLNTNG 

GGGFLKKAIENTGKTPFNQLASTKYPEVDLPTVGSMLSIWADRPSAEYKEEEIFELMTAFADHNKDYFRA 

NYNALREELAKIPTNLEGYSKESLEALDAAKT^ 

DENEVAANVETRPELITRTEEIPFEVIKKENPNLPAGQENIITAGVKGERTHYISVLTENGKTTETVLDS 
QVTKEVINQWEVGAPVTHKGDESGLAPTTEVKPRLDIQKEEIPFTTVTRENPLLLKGKTQVITKGVNGH 
RSNFYSVSTSADGKEVKTLVNSWAQEAVTQIVEVGTMVTHVGDENGQAAIAEEKPKLEIPSQPAPSTAP 
AEESKALPQDPAPWTEKKLPETGTHDSAGIiWAGLMSTLAAYGLTKRKED 
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31/47 

SEQ ID NO: 211 
>AnrP79161 

MANKKIRIRLKAYEHRTLDTAAAKIVESATRTGAQVAGPIPLPTERSLYTIIRATHKYKDSREQFEMRTH 
KRL I D I VNPTQKTVD ALMKLDL P S GVNVE I KL 

SEQ ID NO: 2 12 

>AnrP480781 

MANVTLFDQTGKEAGQV^ 

GRARQGSIRSPQWRGGGWFGPTPRSYGYKLPQKVRRLALKSVYSEKVAENKFVAVDALSFTAPKTAEFA 
KVLAALSIDSKVLVILEEGNEFAALSARNLPNVKVATATTASVLDIANSDKLL 

SEQ ID NO: 213 
>AnrP378449 

MNLYDVI KKP VI TE S SMAQLE AGKYVF EVDTRAHKLL I KQ AVE AAF EGVKVANVNTINVKPKAKRVGRYT 
GFTNKTKKAI ITLTADSKAIELFAAEAE 

SEQ ID NO: 214 
>AnrP271322 

MKLNEVKEFVKELRGDSQEELAKRENELKKELFELRFQAATGQLEQTARLKEVKKQIARIKTVQSEAK 
SEQ ID NO: 2 15 

>AnrPl9648 _ 

MI QTETRLKVADNS GARE I LT I KVLGGS GRKFANI GDVI VASVKQATPGGAVKKGDWKAVI VRTKSGAR 

RADGSYIKFDENAAVIIREDKTPRGTRIFGPVARELREGGFMKIVSLAPEVL 
SEQ ID NO: 216 

>AnrP635629 „ „ T ^„ 

MFVKKGDKVRVIAGKDKGTEAWLTAL^ 

LDKNGVAGRVGYKFVDGKKVRYNKKS GEVLD 

SEQ ID NO: 217 
>AnrPl22681 

MANRLKEKYIiNEWPAItTEQFNYS SVl^VPKVDKIVLNMGVGEAVSNAKSLEKAAEELAL I SGQKPL ITK 
AKKSIAGFRLREGVAIGAKVTLRGERMYEFLDKLVSVSLPRVRDFHGVPTKSFDGRGNYTLGVKEQLIFP 

EINFDDVDKTRGLDIVIVTTANTDEESRALLTGLGMPFAK 

SEQ ID NO: 218 
>AnrP311685 

MA7MTDPIADFLTRIRNANQAKHEVIjEVPASNIKKGIAEILKREGFVKNVEIIEDDKQGVIRVFIjKYGPNG 
EKVITNLKRVSKPGLRVYKKREDLPKVLNGLGIAILSTSEGLLTDKEARQKNVGGEVIAYVW 

SEQ ID NO: 2 19 
>AnrPl99123 

MLSLCLEASNRNLWRCFCVAKDDVIEVEGKVVDTMPNAMFTVELENGHQILATVSGKIRKNYIRILAGDR 
VTVEMSPYDLTRGRITYRFK 

SEQ ID NO: 22 0 
>AnrP449861 

MIEFEKPNITKIDENKDYGKFVIEPIiERGYGTTLGNSLRRVLIiASIiPGAAVTSINIDGVLHEFDTVPGVR 
EDVMQIILNIKGIAVKSYVEDEKIIELDVEGPAEVTAGDILTDSDIEIVNPDHYLFTIGEGSSLKATMTV 

NSGRGYVPADENKKDNAPVGTDAVDS I YTPVTKVXvIYQVE PARVGSNDGFDKDTLE ILTNGTI I PEDALGL 
SARILTEHDDLFTNLTEIAKSTEVMKEADTESDDRIDDRTIEELDLSVRSYNCLKRAGINTVHDLTEKSE 

AEMMKVRNLGRKSLEEVKLKL IDLGLGLKDK 

SEQ ID NO: 221 
>AnrP498702 

MKLIjKKMMQVIjIj AVF F FGLL ATNT VF ANTTGGRF VDKDNRKY YVKDDHKAI YWHKI DGKT Y YF GDI GEMV 

VGWQYLEIPGTGYRDNLFDNQPWEIGLQEKWYYFGQDGALLEQTDKQVLEAKTSENTGKVYGEQYPDSA 

EKRTYYFDNNYAVKTGWIYEDGNWYYLNKLGNFGDDSYNPLPIGEVAKGWTQDFHVTIDIDRSKPAPWYY 

LDASGKMLTDWQKVNGKWYYFGSSGSMATGWKWRGKV^YLDNKNGDMKTG 

GV^QDGLTWYYLNAGNGDMKTGWFQVNGKWYYA 
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>AnrP973350 

MKLLKKMMQVALAVFFFGLLATNW 

VGWQYLEIPGTGYRDlSnijFDNQPVNEIGLQEKWYYFGQDGALIjEQTDKQVLEAKTSEN 

EKRTYYFDNJST^AVKTGWIYEEGHWYYLNKLGNFGDDSYNPLPIGEVAKGWTQDFHVTIDIDRSKPAPWYY 

LDASGKMLTDWQKVNGKWYYFGSSGSMATGWKYVRGKWYYLDNKNGDMKTGWQ 

GWYQDGS TWYYLD P SNGDMKI GWTKVNGKWYYLNSNGAMVTGS QTIDGKVYNFAS S GEWI 

SEQ ID NO: 223 
>AnrP210688 

MKILKKTMQVGLTVFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDSIiSGE 

MWGWQYIPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKRED 

SEDKEEKRYYTNYYFNQNHSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDSTWYYLDPTTGI 

MQTGWQYLGNKWYYLRSSGAMATGWYQEGTTWYYLDOT 

STWYYLNAGNGDMKTGWFQWGNWY^ 

SEQ ID NO: 224 
>AnrP449261 

MSVSFENKETNRGVLTFTISQDQIKPELDRVFKSVKKSLNVPGFRKGHLPRPIFDQKFGEEALYQDAMNA 
LLPNAYEAAVKEAGLEWAQPKIDVTSMEKGQDWITAEWTKPEVKLGDYKNLEVSVDVEKEVTDADVE 
ERIERERNNLAELVIKEAAAENGDTWIDFVGSIDGVEFDGGKGENFSLGLGSGQFIPGFEDQLVGHSAG 
ETVDVIVTFPEDYQAEDLAGKEAKFVTTIHEVKAKEVPALDDELAKDIDEEVETLADLKEKYRKELAAAK 
EETYKDAVEGAAIDTAVENAEIVELPEEMIHEEVHRSVNEFLGNLQRQGINPDMYFQITGTTQEDIiHNQY 
QAEAESRTKTNLVI EAVAKAEGFDAS EEE I QKEVEQLAADYNMEVAQVQNLLjS ADMLKHDIT I KKAVELI 
TSTATVK 

SEQ ID NO: 225 
>AnrP551355 

MKKRYLVLTALLAL SLAAC SQEKTKNEDGETKTEQTAKADGTVGSKS QGAAQKKAE WNKGD Y Y S I QGKY 
DEIIVANKHYPLSKDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKL-YQDYWQDGKAA 
ADRYSARPGYSEHQTGLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFWRYLKGKEKETGYMAEEWHLRY 
VGKEAKEIAASGLSLEEYYGFEGGDYVD 

SEQ ID NO: 22 6 
>AnrP32375 

MKKSTVLSLTTAAVILAAYAPNEVVLADTSSSEDALNISDKEKVAENKEKHENIHSAMETSQDFKEKKTA 
VIKEKEWSKNPVIDNNTSNEEAKIKEENSNKSQGDYTDSFWKNTENPKKEDKVVY-IAEFKDKESGE^ 
IKELSSLJOtfTKVLYTYDRIFNGSAIEOT 

KS INAPFGKNFDGRGMVI SNIDTGTDYRHKAMRIDDDAKASMRFKKEDLKGTDKNYWLSDKI PHAFNYYN 
GGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDIKlSrFNGIDGIAPNAQIFSYKMYSDAGSGFAGDETM 
FHAI ED S I KHNVDWS VS S GFTGTGLVGEKYWQAIRALRKAG I PMWATGNYATS AS S S SWDLVANNHLK 
MTDTGNVTRTAAHEDAIAVASAKNQTVEFDKVNIGGESFKYRNIGAFFDKSKITTNEDGTKAPSKLKFVY 
IGKGQDQDLIGLDLRGKIAWDRIYTKDLKNAFKKAMDKGARAIMVWTVNYTO 

GTKSQVFSISGDDGVKLWNMINPDKKTEVKRNNKEDFKDKLEQYYPIDMESFNSNKPNVGDEKEIDFKFA 
PDTDKELYKEDIIVPAGSTSWGPRIDLLLKPDVSAPGKNIKSTLNVINGKSTYGYMSGTSMATPIVAAST 
VLIRPKLKEMLERPVLKNIjKGDDKIDL^ 

ALRNEWATFKNTDSKGLWSYGSISLKEIKGDKKYFTIKLHNTSNRPLTFKVSASAITTDSLTDRLiKLD 
ETYKDEKSPDGKQIVPEIHPEKVKGANITFEHDTFTIGANSSFDLNAVINVGEAKNKNKFVESFIHFESV 
EEMEALNSNGKKINFQPSLSMPLMGFAGNWNHEPILDKWAWEEGSRSKTLGGYDDDGKPKIPGTLNKGIG 
GEHGIDKFNPAGVIQNRKDKNTTSLDQNPELFAFNNEGINAPSSSGSKIANIYPLDSNGNPQDAQLERGL 
TPSPLVLRSAEEGLISIWTNKEGENQRDLKVISREHFIRGILNSKSNDAKGIKSSKLKVWGDLKWDGLI 
YNPRGREENAPESKDNQDPATKIRGQFEPIAEGQYFYKFKYRLTKDYPWQVSYIPVKIDNTAPKIVSVDF 
SNPEKIKLITKDTYHKVKDQYKNETLFARDQKEHPEKFDEIANEVWYAGAAIiWEDGEVEKNLEVTYAGE 
GQGRNRKLDKDGNTIYEIKGAGDLRGKIIEVIALDGSSNFTKIHRIKFANQADEKGMISYYLVDPDQDSS 
KYQKLGEIAESKFKNLGNGKEGSLKKDTTGVEHHHQENEESIKEKSSFTIDRNISTIRDFENKDLKKLIK 
KKFREVDDFTSETGKRMEEYDYKYDDKGNIIAYDDGTDLEYETEKLDEIKSKIYGVLSPSKDGHFEILGK 
ISNVSKNAKVYYGNNYKSIEIKATKYDFHSKTM 

RMPEKIKETKSEYPWSSYGNVIELGEGDLSKNKPDNLTKMESGKIYSDSEKQQYLLKDNIILRKGYALK 
VTTYNPGKTDMLEGNGVYSKEDIAKIQKANPN^ 

QVFTFKMNDKGEAIDKDGNLVTDSSKLVLFGKDDKEYTGEDKFJWEAIKEDGSML 

FNPSKSNKIYVRNPEFYLRGKISDKGGFNWELRWESVVDNYLIYGDLHIDNTRDFNIKLNVKDGDIM 
GMKDYKANGFPDKVTDMDGNWLQTGYSDLNAKAVGVH^ 

NINDKRNNGFDGEIQEQHIYINGKEYTSFNDIKQIIDKTLNIKIWKDFARNTTVKEFILNKDTGEVSEL 
KPHRVTVTIQNGKEMSSTIVSEEDFILPVYKGELEKGYQFDGWEISGFEGKKDAGYVINLSKDTFIKPVF 
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KKIEEKKEEE3NTKPTFDVSKKKDNPQ 

TNNPNKL PKTGTAS GAQTLL AAG IMF I VG I FLGLKKKNQD 

SEQ ID NO: 227 
>AnrP710228 

MGKGHWNRKRVYS I RKFAVGAC SVMI GTC AVLLGGNI AGE SWYADETL I THTAEKPKEEKMI VEEKADK 
ALETKNIVERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTN 

EEVNQMI EDRKVDFNQNWYFKLNANSKEAI KPDADVSTWKKLDL P YDWS I FNDFDHE S P AQNEGGQLNGG 

EAWYRKTFKLDEKDLKKNVRLTFDGV^ 

WKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTILTPKIiEEQQHGKVETHVTSKIVNTDDKDHELVAE 

YQ I VERGGHAVTGLVRTASRTLKAHE S T S LDA I LEVERPKLWTVLNDKP ALYEL I TRVYRDGQLVDAKKD 
LFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEElSnfKAEYRRLKQMKEMGWSIRTTHNPASEQ 

TLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEK^ 
IGNEIGEANGDAHSLATVKRLVKVIKDVDKTRW^ 

ALRAKHPKmiYGSETSSATRTRGSYYRPERELKHSNGPERJSnfEQSDYGNDRVGWGKTATASWTFDRDN 
GYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVOT 

ENKELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKYAYQ 
PGTLEAIARDESGKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYI^ 

QLHGQGQLVGVDNGEQASRERYKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFT 
GKKEGQEKTVLGTEVPKVQTIIGEAPEMPTTVPFVYSDG 

EARVEVIALKSELPWKRIAPNTDLNSVDKSVSYVLIDGSVEEYEVDKWEIAEEDKAKLAIPGSRIQATG 
YLEGQPIHATLWEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLAYGAKLPEVTASAKNAAVTVLQA 
SAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSIiQVEKADSLKEDQTVKLSVRAHYQDGTQAVLP 
ADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVWVTDLHQE 
PSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTP 
IAEAPQLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTWGRLEGTQLTTKLHVRVSAQTEQGANI 

SDQWTGSELPLAFASDSNPSDPVSNVNDK^ 
LSVGFHEDHGVGVPKSYVIEYYVGKTVPTAPKNPSFV^ 

SFDKVETYAVRIRMVKADNKRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGK 
VPAVTASVSNNGLATWPSVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQ 
ALELPTKVPWFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEITVRVTDKIiGETLSD 

NPNYDENSNQAFASATISTOIDKNSHDRVDYIjNDGDHSENRRWTISIWSPTPSSNPE 

VTQGKVQFFADSGTDAPSKLVLERWGPEFEVPTYYSNYQAYDADHPFlSnsrPENWEAVPYRADKDIAAGDE 

IWTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAE 
GQRPKVSVEElSnSTQVASTWDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEF 
VEKDLAYKTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREWEVPTDRIVLVGTKPVAQ 
EAKKPQVSEKADTKPIDSSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLWTKGKKED 

SEQ ID NO: 22 8 
>AnrP723238 

MTYL P VALT I AGTD P S GGAGIMADLKS F QARDVYGMAWTSL VAQNTRGVQL I EHVS PQMLKAQLE SVF S 
DIP PQAVKTGMLATTE IME 1 1 QPYLKKLDC PYVLDPVMVATSGDALIDSNARDYLKTNIjLPLATI ITPNL 
PEAEEIVGFSIHDPEDMQRAGRLILKEFGPQSWIKGGHLKGGAKDFLFTKNEQFVWESPRIQTCHTHGT 

GCTFAAVITAELAKGKSLiYQAVDKAKAFITKAIQDAPQLGHGSGPVNHTTFKD 

SEQ ID NO: 229 
>AnrP267094 

MNKKQWLGLGLVAVAAVGL AACGNRS SRNAAS S SDVKTKAAI VTDTGGVDDKS FNQ S AWEGLQAWGKEHN 
D S KDNGFTYFQ S T S EAX) YANNLQQAAG S YNL I FGVGFALNNAVKD AAKEHTDLNYVL I DDVI KDQKNVAS 
VT F ADNE S G YL AGVAAAKTT KTKQVGF VGG I E S E V I SRFEAGFKAGVASVDPS I KVQVD YAGS F GDAAKG 
KTIAAAQYAAGADIWQVAGGTGAGVFAEAKSLNESRPENEKVWIGVDRDQEAEGKYTSKDGKESNFVL 
VSTLKQVGTTVKDISNKAERGEFPGGQVIVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKIbDGSVKVPEK 

SEQ ID NO: 23 0 
>AnrP736063 

MKKKLLAGAITLLSVATLAACSKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQY 
GSELDDKEVDDTIAEEKKQYGENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFD 
EYTPDVTAQIIRLNNEDKAKEVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPEQVKKAA 
FALDVDGVSDVITATGTQAYSSQYYIVKLTKKTEKSSNIDDYKEKLKTVILTQKQNDSTFVQSIIGKELQ 

AANIKVKDQAFQNI FTQYI GGGDSS S S SSTSNE 
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>AnrP3443 5 

MKNKFFLIAILAMCIVFSACSSNSVKNEENTSKE^ 
ALGIVPVGFSKANYGVSADKGVLPWTEEKIKELNGK^^ 
YDTLSKIAPVAAYKSKPWQTIjWRDMIKIDSKALGM 
FTMINAADTSKFWIYTSKDPRANYLTDLGLVFPESLKEFE 

TLEALQKDPIjIjGKINAIKNGAVAVI pdntplaasctptpls inytieeylnllgnacknak 



SEQ ID NO: 232 
>AnrPl72568 

MSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYGGLGTQKAVDN 

VNNI IAEAI IGYDVRDQQAIDRAMI ALDGTPNKGKLGANAILGVS IAVARAAADYLE I PLYS YLGGFNTK 
ALPTPMMNIINGGSHSDAPIAFQEFMILPVGAPTFKEALRYGAEIFHALKKILKSRGLETAVGDEGGFAP 
RFEGTEDGVETILAAIEAAGYVPGKDVFLGFDCASSEFYDKERKVYDYTKFEGEGAAVRTSAEQIDYLEE 

LVNKYP 1 1 T I EDGMDENDWDGWKADTERLGKKVQLVGDDFFVTNTD YL ARG I QEGAANS ILIKVNQIGTL 
TETFEAIEMAKEAGYTAWSHRSGETEDSTIADIAVATNAGQIKTGSLSRTDRIAKYNQLLRIEDQLGEV 

AEYRGLKS F YNLKK 



SEQ ID NO: 233 
>AnrP559469 

MEKYFGEKQERFSFRKLSVGWSATISSLFFMSVLASSSVDAQETAGVHYKYVADSEIiSSEEKKQLVYDI 
PT YVENDDETYYLVYKLNS QNQL AEL PNTGS KNERQALVAGASL AALG I L I FAVS KKKVKNKTVLHLVLV 
AGMGNGVLVSVHALENHLLLNYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSDFEVSNQEKSAATP 
TKQQKVDYNVTPNFVDHPSTVQAIQEQTPVSSTKPTEVQWEKPFSTELINPRKEEKQSSDSQEQLAEHK 

NLETKKEEKI S PKEKTGVNTLNPQDEVLS GQLNKPELLYREETI ETKI DFQEE I QENPDL» AEGTVRVKQE 

GKLGKKVEIVRIFSVNKEEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQPETGVEHKDVQSGAIVEPAI 

QPELPEAWSDKGEPEVQPTLPEAWTDKGETEVQPESPDTWSDKGEPEQVAPLPEYKGNIEQVKPETP 

VEKTKEQGPEKTEEVPVKPTEETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSENTGEVS 

SNPSDSTTSVGESNKPEHNDSKNENSEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANE 

NTGEVSNKPSDSKPPVEESNQPEKNGTATKPENSGNTTSENGQTEPEKKLELRNVSDIELYSQTNGTYRQ 

HVSLDGIPENTDTYFVKVKSSAFKDWIPVASITEEKRNGQSVYKITAKAEKLQQELENKYVDNFTFYLD 

KKAKEENTNFTSFSNLVKAINQNPSGTYHLAASLNANEVELGPDERSYIKDTFTGRIiIGEKDGKNYAIYN 

LKKPLFENLSGATVEKLSLKIWAISGKNDIGSIjANEATNGTKIKQVHVDGVLAGERGVGG 

AESS FKGRI WTYETTDAYNIGGLVGHLTGKNAS I akskatvti S SNTNRSDQTVGGLAGLVDQDAHI QN 

S YAEGD INNVKHFGKVAGVAGYLWDRT sgeekhageltjwlsdvnvtngnai tgyhytgmkvantf S s ka 

NRVFNVTDEKDEWSKESFEERGTMLDASQIVSKKAEINPLTLPTVEPLSTSGKKDSDFSKIAHYQANRA 
LWKNIEKLLPFYNKSTIVKYGNLVKENSLL^ 

EKFDLKYQTDFANIiAEYNLGNTGLLYTPNQFLYDRDSIVKEVLPELQKLDYQSDAIRKTLGISPEVKLTE 

LYLEDQFSKTKQNLGDSLKKLLSADAGLASDNSVTO 

KDLWraPDFFGKGNTSPLDTLIELGK^ 

SNNDWFKS ETKAYI VEEKSTI EEVKTKQGLAGTKYS I GVYDRITS ATWKYRNMVDPLLTLPERSVFVIST 
MSSLGFGAYDRYRSSDHKAGKALNDFVEENARETAKRQRDHYDYWYRILDEQSREKLYRTILLYDAYKFG 
DDTTSGKATAEAKFDSSNPAMKNFFGPVGNKVWNQHGAYATGDGWYMSYRMLDKDGAITYT 
DQDIYLGGYGRRNGLGPEFFAKGLLQAPDQPSDATITINSIIiKHSKSDSTEGSRLQVLDPTERFQNAADL 

QNYVHNMFDL I YMMEYLEGQS IVNKLSVYQKMAALRKI ENKYVKDPADGNEVYATNWKELTEAEARNLN 
SFESLIDHNILSAREYQSGDYERNGYYTIKLFAPIYSALSSEKGTPGDLMGRRIAYELLAAKGFKDGMVP 
YISNQYEEDAKQQGQTINLYGKERGLVTDELVLKKVFDGKYKTWAEFKTAMYQERVDQFGNLKQVTFKDP 
TKPWPSYGTKTINWDELQALMDQAVLKDAEGPRWSNYDPEIDSAWKLKRAIFKAYIiDQTNDFRSSIFE 

NKK 



SEQ ID NO: 234 
>AnrP229477 

MAKEKYDRSKPHVNIGTIGHVDHGKTTLTAAITTVLARRLPSAVNQPKDYASIDAAPEERERGITINTAH 
VEYETEKRHYAHIDAPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILL 

IDLVDDEELLELVEMEIRDLLSEYDFPGDDLPVIQGSALKALEGDSKYEDIIMELMNTVDEYIPEPERDT 
EKPLLLPVEDVFSITGRGTVASGRIDRGTVRVNDEIEIVGIKEETQKAWTGVEMFRKQLDEGIiAGDNVG 
VLLRGVQRDEIERGQVIAKPGSINPHTKFKGEVYILTKEEGGRHTPFFNNYRPQFYFRTTDVTGSIELPA 

GTEMVMPGDNVTIDVELIHPIAVEQGTTFSIREGGRTVGSGMVTEIEA 

SEQ ID NO: 235 
>AnrP9607 6 

MKKLGTLIiVLFLSAIILVACASGKKDTTSGQKLKWATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYE 
PLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHA 
WLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKY 
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FSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTD 
S I AEQGKEGD S YYSMMKYNLDKI AEGLAK 

SEQ ID NO: 23 6 
>AnrP118814 

MVTFLGNPVSFTGKQLQVGDKALDFSLTTTDLSKKSLADFDGKKKVLSWPSIDTGICSTQTRRFNEELA 
GLDNT WLTVSMDL PFAQKRWC GAEGLDNAIMLSDYFDHS FGRDYAHL INEWHLLARAVFVLDTDNTI RY 
VE YVDNINS E PNF E AAI AAAKAL 

SEQ ID NO: 237 
>AnrP470544 

MTFSFDTAAAQGAVIKVIGVGGGGGJNTAINRIWD 

GAGGQPEVGRKAAEESEETLTEAISGADMVFITAGMGGGSGTGAAPVIARIAKDLGALTVGWTRPFGFE 

GSKRGQFAVEGINQLREHVDTLLIISNNNLLEIVDKKTPIiLEALSEADNV 

DFADVKTVMANKGNALMGIGIGSGEERVVEAAR 

Q I VNQAAGQGVNI WLGT S I DE SMRDE I RVTVVATGVRQDRVEKVVAPQARS ATNYRETVKPAHS HGFDRH 
FDiytAETVELPKQNPRRLEPTQASAFGDWDLRRESIVRTTDSWSPVERFEAPISQDEDELDTPPFFKNR 

SEQ ID NO: 23 8 
>AnrP793162 

MKFRKLAC TVL AGAAVLGLAAC GNSGGS KDAAKSGGDGAKTE I TWWAF P VFTQEKTGDGVGTYEKS 1 1 E A 
FEKANPDIKVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNN 
ENIVQASKAGDKAYMYPISSAPFYMAMNKKM^ 

GQGGDQGTRAFISNLYSGSVTDEKVSKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQ 
TSYTILWAPAQNGIQAKLIiEASKVEWEVPFPSDEGKPALEYLWGFAVFNNKDDKKVAASKKFIQFIAD 
DKEWGPKDWRTGAFPVRTSFGKLYEDKRMETISGWTQYYSPYYNTIDGFAEMRTLWFPMLQSVSNGDEK 
PADALKAFTEKANETIKKAMKQ 

SEQ ID NO: 239 
>AnrP819166 

MTNLIATFQDRFSDWLTADSQHLQLSLLT^ 

GLFIPLMGIGTLPALTALVIYAIFPILQNTITGLKGIDPNLQEAGIAFGMTRWERLKKFEIPLAMPVIMS 

GIRTAAVLI IGTATL AALIGAGGLGSF ILLGIDRNNASL IL IGALS S AVLAI AFNFLLKVMEKAKLRTI F 

SGFALVALLLGLSYSPALLiVQKEKENIiVIAGKIGPEPEILANMYKLLIEENTSMTATVKPNFGKTSFM 

AliKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARDGIAKQDHLAYLKPMSYQNTYAVAVPKKIA 

QEYGLKTISDLKWEGQLKAGFTLEFNDREDGNKGLQSMYGrjNLNVATIEPALRYQAIQSGDIQITDAYS 

TDAELERYDLQVLEDDKQIiFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQIiNYQVGVEGKSA 

KQVAKEFLQEQGLLKK 

SEQ ID NO: 240 
>AnrP373238 

MICSDSSYSFHNKMFMIFIRRKSLMVVKVGINGFGRIGRLAFRRIQNVEGVEVTR 

YDTTQGRFDGTVEVKEGGFEVNGKFIKVSAERDPEQIDWATDGVEIVLEATGFFAKKEAAEKHLKGGAKK 
WITAPGGNDVKTWFNTNHDVLDGTEW 

LDGPHRGGDLRRARAGAANIVPNSTGAAKAIGLVIPELNGKLDGSAQRVPTPTGSVTEWAVLEKNVTVD 

EVNAAMKAASNESYGYTEDPIVSSDIVGMSYGSLFDATQTKA^ 

LEYFAKIAK 

SEQ ID NO: 241 
>AnrP377050 

MT SKVRKAVI P AAGLGTRFLPATKALAKEMLP IVDKPTI QF I VEEALKS GI ED ILWTGKSKRS I EDHFD 
SNFELEYNLKEKGKTDLLKLVDKTTDMRL 

KAVPLTKQLMDDYERTHASTIAVMPVPHDEVSAYGVIAPQGEGKDGLYSVETFVEKPAPEDAPSDLAIIG 
RYLLTPEIFEIIiEKQAPGAGNEIQLTDAIDTLNKTQRVFAREFKGARYDVGDKFGFMKTSIDYALKHPQV 
KDDLKNYLIQLGKELTEKE 

SEQ ID NO: 242 
>AnrP149458 

MKKISLLLASLCAIiFLVACSNQKQADGKLNIVTTFYPVYEFTKQVAGDTANVEIiLIGAGTEPHEYEPSAK 
AVAKIQDADTFVYENENMETWPKLLDTLDKKWKTIKATGDMIiIiliPGGEEEEGDHDHGEEGHHHEFDPH 
VWLSPVRAIKLVEHIRDSLSADYPDKKETFEKNAAAYIEKLQSLDKAYAEGLSQAKQKSFVTQHAAFNYL 
ALDYGLKQVAISGLSPDAEPSAARLAELTEYVKKNKIAYIYFEENASQALANTLSKEAGVKTDVLNPLES 
LTEEDTKAGENYISVMEKmKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTIi 
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YPFLEDGTFDQVFDYKAKLTGKMTQAEYKAYYTKGYQTDVTKINITDNTMEFVQGGQSKKYTYKYVGKKI 
LTYKKGNRGVRFLFEATDADAGQFKWQFSDHNIAPVKAEHFHIFFGGTSQETLFEEMDISMPTYYPDNIjS 
GQEIAQEMLAH 

SEQ ID NO: 243 
>AnrP354979 

MFASKSERKVHYSIRKFSIGVASVWASLFLGGWHAEEVGGKNTPTDTSSGQDISKKYADEVESHLKKI 
LSEIQTQLDRKRHTETVALINELQGIKKTYLYNL3WLKEKSELPSKIKAKLDVAFDQFKKDTLKPGEKVA 
EAQKKVAEAKKKAEDQKEEDRRIWPTNTYK^^ 

ESEKAEAIRLEEIKTDREEAKRKADAKLKEAVEKNAANSEQGEPKRRVKRGVLGEPATPDKKENDAKSSD 

SSVGEETLPSPSIiKPEKJCVAEAEKKKAEDQEEEDRRIWPTNTYKTIiELDIAESDVKVKEAELELVNEEAK 

PRNEEKIKKAKAKVESEKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQPEKPAEE 

PNPAPAPKPEKPADQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKPEQPAPAPKTGWKQENGMWYFY 
NTDGSMATGWLQNNGSWYYLNSNGAJY^ 

FQYNGSWYYLNANGDMATGWFQYNGSWYYLNAN^ 
NANGSMATDWVKDGDTWYYLEASGAMKASQW 

SEQ ID NO: 244 
>AnrP958511 

MAVISMKQLLEAGVHFGHQTRRWNPKMAKYIFTERNGIHVIDLQQTVKYADQAYDFM 

GTKKQAADAVAEEAVRSGQYFINHRWLGGTLTNWGTIQKRIARLKEIKRMEEDGTFEVLPKKEVALLNKQ 
RARLEKFLGGI EDMPRI PDVMYWD PHKEQ I AVKE AKKLG I P WAMVDTNTDPDD I DVI I PANDDAI RAV 
KL I TAKLADAI I EGRQGEDAVAVEAEF AALETQAD S I EE I VEWEGDNA 

SEQ ID NO: 245 
>AnrP72782 

MKFNPNQRYTRWSIRRLSVGVASVWASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPG 
DTVLTQAKPEGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQ 
ASPAETTPEQVKGGVKENTKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMIiTRLLLKASDNAP 
WSDNGTAKNPALPPLiEGLTKGKYFYEVT)LNGNTVGK^ 

NLVATKWDININGIiVAKETVQKAVADNVKDSIDVPAAYLEKAKGEGPFTAGVNHVIPYE 
LLLKASDKAPWSDNGDAKNPAIjSPLGENVKTKGQYFYQVALDGNVAGKEKQALIDQFRANGTQTYSATW 
VYGNKDGKPDLDNIVATKKVTININGLI SKETVQKAVADNVKDSIDVPAAYLEKAKGEGPFTAGVNHVI P 
YELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGEW 

ANGTQTYSATVNVTGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDVPAAYLEKAKGE 
GPFTAGVNHVI PYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALS PLGENVKTKGQYF YQVALDGNVA 

GKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTIKINVKETSDTANGSLSPSNSGSGV 
TPMNHNEiATGTTDSMPAIDTM 

LAGIiLGGLGLKNKKEEN 

SEQ ID NO: 246 
>AnrP40452 

MKNWKKYAFAS AS WALAAGL AAC GNLTGNSKKAAD S GDKPVI KM YQ I GDKPDNLDELLANANKIIEEKV 
GAKLDIQYLGWGDYGKKMSVITSSGENYDIAFADNYIVNAQKGAYADLTELYKKEGKDIjYKALDPAYIKG 
NTWGKIYAVPVAAJWASSQNFAFNGTLLAKYGIDISGVTSYETLEPVLKQIKEKAPDWPFAIGKVFIP 
SDNFDYPVANGLPFVIDLEGDTTKWNRYEVPRFKEHLKTLHKFYEAGYIPKDVATSDTSFDLQQDTWFV 
REETVGPAJDYGNSLLSRVANKDIQIKPITNFIK^ 

GLVYGPEGKNWEKI EGKENRVRVLDGYKGNTHMGGV^ S PA 

LGFIFNTDIWKSEISAIANTMQQFDTAINTGTVDPDK^ 

K 

SEQ ID NO: 247 
>AnrP179757 

MAEIYLAGGCFWGLEEYFSRISGVLETSVGYANGQVETTNYQLLKETDHAETVQVIYDEKEVSLREIIjLY 

yfrvidplsinqqgndrgrqyrtgiyyqdeadlpaiytwqeqermlgrkiaveveqlrhyilaedyhqd 

YLRKNPSGYCHIDVTDADKPLiIDAANYEKPSQEVLKASLSEESYRVTQEAATEAPFTNAYDQTFEEGIYV 

dittgeplffakdkfasgcgwpsfsrpiskelihyykdlshgmerievrsrsgsahlghvftdgprelgg 

LRYC INSASLRFVAKDEMEKAGYGYLLiPYLNK 

SEQ ID NO: 248 
>AnrP835378 

ILGAGFVASQPTWRAEEAEKKAVEAKQKVDAEKYALEAKIAELEYEVQGLEKELKEIDESDSEDYIKEG 
LRAPLQSKIiDAKKAKLSKIiEELSDKIDELDAEIAKLEKDVEDFKNSDGEQAEQYLVAAKKDLDAKKAELE 
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NTEADLKKAVDEPETPAPAPAPKPAPAPAPTPEAPAPAPKPETPKTGWKQENGM 

SEQ ID NO: 249 
>AnrP277775 

MFASKSERKWYSIRKFSIGVASVWASLWGSWHATENEGITQVPTSYNKANESQTEHRKAAKQVDED 
IKKMLSEIQEYIKKMLSEIQLDKRKDTQNRTLNRKLSAIQTKYLYELRVLKEKSKKEELTSKTKKELDAA 
FEKFKKEPELTKKLAEAKQKAKAQKEEDFRN^PT^ 

IKQAKAKVESKKAEATRLEEIKTERKRAEEEAKRKAGESEEKAAEANQKVDTKEQGKPKRRAKRGVSGEIi 
ATPDKKENDAKSSDSSVGEETLPSPSLNMAISrESQTEHRKDVDEYIKI^ 

AIKTKYLYELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEKKVAEAEKKVEEAKKKAKDQKEEDR 
RNYPTNTYKTLELEIAESDVKVKEAELELVKEEAN 

EEEAKRKAEESEKKAAEAKQKVDAEEYALEAKIAELEYEVQRLEKELKEIDESDSEDYLKEGLRAPLQSK 
LDTKKAKLSKLEELSDKIDELDVNCNLRSQLKDAEGNNIWEAYFKEG 

DEPETPAPAPQPAPAPEKPAEKQAPASSPEKPAPAPEKPGPAPEKPAPAPEKPAPTPETPKTGWKQENGM 

WYFYNTDGSMATGWLQNNGSWYYLNSNGAM^ 

ATGWLQYNGSWYYLNANGDMATGWL^ 

WYYWGSGALAWTTVDSYRWANGEWVN 

SEQ ID NO: 25 0 
>AnrP181233 

MKLLKKMMQ I AL ATFF FGLL ATNTVFADD S EGWQFVQENGRTYYKKGDLKET YWRVI DGKY YYFDPL S GE 
1WVGWQYIPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAE 
KRVYYFEDQRSYHTLKTGWIYEEGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYDEEKLKAAPWY 
YLNP ATGIMQTGWQYLGNRWYYLHS S GAMATGWYKEGS TWYYLDAENGDMRTGWQNLGNKWYYLRS S GAM 
ATGWYQE S S TWYYLNASNGDMKTGWF QVNGNWYYAYD SGALAVNTTVGGYYLNYNGEWVK 

SEQ ID NO: 251 
>AnrP894040 

MAREFSLEKTRNIGIMAHVDAGKTO 

QWNNHRWIIDTPGHVDFTIEVQRSLRVLDGAVTVLDSQSGVEPQTETWRQATEYGVPRIVFANKMDKI 
GADFLiYSVSTLHDRLQANAHPIQLPIGSEDDFRGIIDLIKMKAEIYTNDLiGTDIIiEEDIPAEYLDQAQEY 
REKLIEAVAETDEEIjIJMKYLEGEEITNEEL 

pldipaikginpdtdaeeirpasdeepfaalafkimtdpfvgrltffrwsgvlqsgsyvlntskgkrer 

igrilqmhansrqeidtwsgdiaaavglkdtttgdsltdekakiilesinvpepviqlmvepkskadqd 

kmgialqklaeedptfrvetnvetgetvisgmgelhldvlvdrmrrefkveanvgapqvsyretfrastq 

argffkrqsggkgqfgdwieftpneegkgfefenaivggwprefipavekglvesmangvlagypm^ 

vkaklydgsyhdvdssetafkiaaslslkeaaksaqpailepmmlvtitvpeenlgdvmghvtarrg 

gmeahgnsqivrayvplaemfgyatvlrsasqgrgtfmmvfdhyedvpksvqeeiikknkged 

SEQ ID NO: 252 
>AnrP297298 

MIEASKLKAGMTFETADGKLIRVLEASHHKPGKGNTIMRMKLRDVRTGSTFDTSYRPEEKFEQAIIETVP 
AQYLYKMDDTAYFMNTETYDQYEIPVVNVENELLYILENSDVKIQFYGTEVIGVTVPTTVELTVAETQPS 
IKGATVTGSGKPATMETGLWNVPDF I EAGQKLVINTAEGTYVSRA 

SEQ ID NO: 253 
>AnrP217378 

MNF ETVI GLEVHVELNTNS KI F S PT S AHFGNDQNANTNVI DWS F PGVL PVLNKGWDAG I KAALALNMD I 
HKKMHFDRKNYFYPDNPKAYQISQFDEPIGYNGWIEVKLEDGTTKKIGIERAHLEEDAGKNTHGTDGYSY 
VDLNRQGVPLIEIVSEADMRSPEEAYAYLTAIiKEVIQYAGlSDVKMEEGSMRVDANISLRPYGQEKFGTK 
TELKNLNSFSNVRKGLEYEVQRQAEILRSGGQIRQETRRYDEANKATILMRVKEGAADYRYFPEPDLPLF 
EISDEWIEEMRTELPEFPKERRARWSDLGLSDYDASQLTANKVTSDFFEKAVALGGDAKQVSNWLQGEV 
AQFLNAEGKTLEQI ELTPENLVEMIAI IEDGTI SSKI AKKVFVHLAKNGGGAREYVEKAGMVQI SDPAIL 
IPIIHQVFADNEAAVADFKSGKRNADKAFTGFLMKATKGQANPQVALKDLAQELAKIiKEN 

SEQ ID NO: 254 
>AnrP898188 

MKITQEEVTHVANLSKLRFSEEETAAFATTLSKIVDWELLGEVDTTGVAPTTTMAD 
TDRDRIiFKNVPEQDNYYIKVPAILDDGGDA 

SEQ ID NO: 255 
>AnrP114671 

MAQDIKNEEVEEVQEEEWKTAEETTPEKSELDLANERADEFENKYLRAHAEMQNIQRRANEERQNLQRY 
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RSQDLAKAILPSLDNLERALAV^ 
ADDEHPVDTIAQVFQKGYKLHDRILRPAMVWYN 

SEQ ID NO:256 
>AnrPl7097 

MSKI IGIDLGTTNSAVAVLEGTESKI I ANPEGNRTTPSWSFKNGEI IVGDAAKRQAVTNPDTVI S IKSK 
MGTS EKVSANGKE YT PQE I SAMI LQ YLKGYAED YLGEKVTKAVI TVP AYFNDAQRQATKDAGKI AGLEVE 
RIWEPTAAALAYGLDKTDKEEKILVFDLGGGTFDVSILELGDGVFDVIiSTAGDNKIiGGDDFDQKIIDHL 
VAEFKKENGIDLSTDKMAMQRLKDAAEKAKKDLSGVTSTQISLPFITAGEAGPLHLEMTLTRAKFDDLTR 
DLVERTKVPVRQALSDAGLSLSEIDEVILVGGSTRIPAWEAVKAETGKEPNKSWPDEWAMGAAIQGG 
VITGDVKDVVLLDVTPLSLGIETMGGV^^ 

NKTLGRFQLTDIPAAPRGIPQIEVTFDIDKNGIVSVKAKDLGTQKEQTIVIQSNSGLTDEEIDRMMKDAE 

ANAEADKKRKEEVDLRNEVTJQAIFATEKTIKETEGKGF 

LNEKAQGLAVKLYEQAAAAQQAQEGAEGAQATGNAGDDWDGEFTEK 

SEQ ID NO: 257 
>AnrP765513 

MANHFRTDRVGME I KREVNE I LQKKVRDPRVQGVT 1 1 DVQMLGDL S VAKVYYT I L SNL ASDNQKAQ I GLE 
KATGTIKRELGRNLKIiYKI PDLTFVKDES I EYGNKIDEMLRNLDKN 

SEQ ID NO: 258 
>AnrP879988 

MSKELS PKYNPAEVEAGRYQKWLDADVFKP SGDQKAKP YS I VI P P PNVTGKLHLGHAWDTTIjQDI 1 1 RQK 

RMQGFDTLWLPGMDHAGIATQAKVEERLRGEGITRYDLGRESFLTKWEWKDEYATTIKEQWGKMGLSTO 

YSRERFTLDEGLSKAVRKVFVNLYKKGWIYRGEFIINWDPAARTALSDIEVIHKDVEGAFYHMNYML 

SRALEVATTRPETMFGDVAVAWPEDPRYKDLIGKNVILPIANKIjIPIVGDEHADPELGTGWKITPAHD 

PNDFLVGQRHNLPQVNV3V[NDDGT3yDSrELAFEFSGM 

VWEPRLSTQWFVKMDQLAKNAIANQDTEDKVEFYP 

ADGEMWGEEAPEGDGWTQDEDVLDTWFSSALWPFSTMGWPEVDSEDFKRYFPTSTLVTGYDIIFFWVSR 
MIFQSLEFTGRQPFQNVLIHGLIRDEQGRKMSKSLGNGIDPMDVIEKYGADALRWFLSNGSAPGQDVRFS 
YEKMDASl^FINKIWNISRYILMNNEGLTLDVAHDWTKVATGEAGWTDRW 
FEFGVAGHILYNFIWEEFANWYVELTKEVLYSDOT 

ISEGSIVTAAYPTVNLAFEDLAAHTGVESLKDLIRAVRNARAEVNVAPSKPITILVKTSDSDLEAFFNSN 

VNYIKRFTNPEHLEIASTIPAPELAMSSVITGAEIYLPLADL^ 

SNERFVANAKPEWQKECDKQADYQAKYDATVARIDEMKKIiVK 

SEQ ID NO: 2 59 
>AnrP59901 

MAKKVEKLVKLQI PAGKATPAPPVGPALGQAGINIMGFTKEFNARTADQAGMI I PWI SVYEDKS FTFVT 

KTPPAAVLLKKAAGVEKGSGTPNKTKVATW 

D 

SEQ ID NO:260 
>AnrP800948 

MARKS KQLRAAIjEKI D S TKAYS VEEAVALAKETNFAKFDATVEVAYNLNI DVKKADQQ I RGAMVL PNGTG 
KTSRVLVFARGAKAEEAKAAGADFVGEDDLVAKINDGWLDFDW 

KTGTVTMDVAKAVE ESKGGKI T YRADRAGNVQA 1 1 GKVS F E AE KL VENF KAFNET I QKAK P ATAKGT YVT 
NLTITTTQGVGIKVDVNSL 

SEQ ID NO: 2 61 
>AnrP119923 

MANI KS AI KRAELNVKQNEKNS AQKS AMRTAIKAFEANP S EELFRAAS S AI DKAETKGL I HKNKASRDKA 
RLSAKLVK 

SEQ ID NO: 2 62 
>AnrP373768 

mEFEDLLNSVSQVETGDWSAEVLTVDATQAWAISGTGVEGVLTLRELTNDRDADINDFVW 
LVLRQWGKDTDTVTYLVSKKRLEARKAWDKLVGREEEWTVKGTRAVKGGLSVEFEGVRGFIPASMLDT 
RFVRNAERFVGQEFDTKI KEVNAKENRF ILS RREWEAATAAARAE VFGKLAVGDWTGKVARI TS FGAF 
VDLGGVDGLVHLTELSHERNVSPKSWTVGEEIEVKIIiDLNEEEGRVSLSLKATVPGPWDGVEQKLAKGD 
WEGTVKRLTDFGAFVEVLPGIDGLVK^SQISHKRI^^ 

EERPAQEEGQKEEKRAARPRRPRRQEKRDFELPETQTGFSMADLFGDIEL 
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>AnrP956241 

MLYLLFISFHKHKKGTEMQDNYTTKAKHLTIDSRRLIERWKKEGKSNREIASIiLGKAPQTIHTEIKRRTVRK 
CLGKGRFKEVYSADYAQQSYENNRKHSVKKSSLTKELKEKILHYHNQKFSPDKKQASTNFKPAGQSIEQR 
SEAINLRLENGYYEIDTVLIiTRAKNYCLL^ 

DNGTEFNRLFDVFS EEHI YYAHPYASWERGTNENHNRL IRRWLPKGTKKMTPKEVAF I EKWINNYPKKCL 
DYKS PREDF WMANLNLKF SVRNKSRN 



SEQ ID NO: 2 64 
>AnrP6905 

MANVIIEKAKERMTQSHQSLAREFGGIRAGRANASLLDRVHVEYYGVETPL 

DKSSLKDIDRALNASDLGITPANDGSVIRLVIPALTEETRRDLAKEVKKVGENAKVAVRNIRRDAMDEAK 
KQEKAKEITEDELKTLEKDIQKVTDDAVKHIDDMTANKEKELLEV 

SEQ ID NO: 2 65 
>AnrP486464 

MTLNNLQLFAHKKGGGSTSNGRDSQAKRLGAKAADGQTVTGGSILYRQRGTHIYPGVNVGRGGDDTLFAK 
VEGWRFERKGRDKKQ VS VYP IAK 

SEQ ID NO: 2 66 
>AnrP255906 

MKKDIHPEYRPWFMDTTTGYQFLSGSTKRSNETVEFEGETYPLIRVEISSDSHPFYTGRQKFTQADGRV 
DRFNKKYGLK 

SEQ ID NO: 2 67 
>AnrP897829 

MALNI ENI I AEIKEAS ILELNDLVKAI EEEFGVTAAAPVAVAAADAADAGAAKDSFDVEIiTS AGDKKVGV 
IKWREITGLGLKEAKELVDGAPALVKEGVATAEAEEIKAKLEEAGASVTLK 



SEQ ID NO: 2 68 
>AnrP798599 

MSEAIIAKKAELVDWAEKMKAAASIW\TOARGLT^^ 

EDLASVFVGPSAVAFSNEDVIAPAKILNDFSKNAEALEIKGGAIEGAVASKEEILALATLPNREGLLSML 
L S VLQAPVRNVAL AVKAVAE S KED AA 



SEQ ID NO:269 
>AnrP356001 

MKQLSSAQVRQMWLDFWATKGHSVEPSVSLW^ 

RTNDIENVGKTARHHTMFEMLGNFSIGDYFRDEAITWAYELLTSPEWFDFPAEKLYMTYYPDDKDSYNRW 
IEVGVDPSHLIPIEDNFWEIGAGPSGPDTEIFFDRGEAFDPENIGLRLLAEDIENDRYIEIV^IVBSQFN 
ADPAVPRSEYKELPHKNIDTGAGLERLVAVIQGAKTNFETDLFMPIIREVEKLSGKVYDQDGDNMSFKVI 
ADHIRSLSFAIGDGALPGNEGRGYVLRRLLRRASMHGQKLGINEPFIiYKLVPTVGKIMESYYPEVLEKRD 
FIEKIVKSEEESFARTLHSGQHFAQGIVADLKEKGQSVIAGSDVFKLYDTYGFPVELTEEIAEEAGMTVD 
REGF EAAMKEQQERARAS AVKGGSMGMQNETLQNI TVES VFNYNAS QLSS KLVAI VADNAE VGAVS EGTA 
SLIFAETSFYAEMGGQVADYGQILDESGKWATVTNVQKAPNGQALHTVEVLAPLALNQEYTLAIDSNRR 
HRVMKNHTATHIjLHAALHNILGNHAT 

TVETDIDTAKEMGAMALFGEKYGKEVRWTIGDYSIELCGGTHVDNTSEIGIiFKIVKEEGIGSGTRRILA 
VTGKEAFEAYREQEDALKAIAATLKAPQVKEVPHKVEGLQEQLRQLQKENAELKEKAAAAAAGDIFKDVK 
EVNGHRYI ASQVSVSDAGALRTFADNWKQKDYSDLLVLV^ 1 1 

DGRGGGKPDMAMAGGSNQPKIQELLDAVAGKL 

SEQ ID NO:270 
>AnrP309378 

MAEKTYPMTLEEKEKLiEKELEELKLVRRPEWERIKIARSYGDLSENSEYEAAKDEQAFVEGQISSIiETK 
I RYAE I VNSDAVAQDEVAI GKTVT I QE I GEDEEEVY 1 1 VG S AGADAF VGKVSNE S P I GQAL I GKKTGDTA 
TIETPVGSYDVKILKVEKTA 



SEQ ID NO: 271 
>AnrP2943 67 

MAKYEILYI I RPNI EEEAKNALVARFDS ILTDNGATVVESKTWEKRRLAYEI QDFREGLYHIVNVEANDD 
AALKEFDRLSKINADILRHMIVKIDA 
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SEQ ID NO: 272 40/47 

>AnrP2 60460 

MTKRVTIIDVKDYVGQEVTIGAWVANKSGKGKIAFI.QI1RDGTAFFQGVAFKPNFVEKFGEEVGLEKFDVI 
KRLSQETSVWTGIVKEDERSKFGYELDITDIEVIGESQDYPITPKEHGTDFLMDNRHLWLRSRKQVAVL 
QIRNAIIYATYEFFDKNGFMKFDSPILSGNAAEDSTELFETDYFGTPAYLSQSGQLYLEAGA]yiALGRVFD 
FGPVFRAEKSKTRRHLTEFWMMDAEYSYLTHDESLDLQEAWKALLQGVLDRAPQALETLERDTELLKRY 
IAEPFKRITYDQAIDLLQEHENDEDADYEHLEHGDDFGS PHETWI SNHFGVPTFVMNYPAAIKAF YMKPV 
PGNPERVIiCADLLAPEGYGEIIGGSMREEDYDALVAKMDEIiGMDRTEYEFYLDLRKYGTVPHGGFGIGIE 

RMVTFAAGTKH I REAI PF PRMLHRI KP 

SEQ ID NO: 273 
>AnrP3 00542 

MAISKEKKNEIIAQYARHEGDTGSVEVQVAVIiTWEINHLNEHIKQHKKDHATYRGLMKKIGRRRNLLAYL 
RKNDVNRYREL INSLGDRR 

SEQ ID NO: 274 
>AnrP433335 

MKLKDTLNLGKTEFPMRAGLPTKEPWQKEWEYAKLYQRRQELNQGKPHFTLHDGPPYANGNIHVGHAMN 
KISKDIIVRSKSMSGFYAPFIPGWDTHGLPIEQVLSKQGVKRKEMDIiVEYLKLCREYALSQVDKQREDFK 
RLGVS GDWENPYVTLTPDYE AAQ I RVFGEMANKGY I YRGAKPVYWS WS S E SALAEAE I EYHDLVS TSLYY 
ANKVKDGKGVLDTDTYIWWTTTPFTITASRGLTVGADIDYVLVQPAGEARKFWAAELIiTSLSEKFGWA 
DVQVLETYRGQELNHIVTEHPWDTAVEELVILGDH^ 

DERGimKNAGPEFEGQFYEKWPTVIEKLGNLLDAQEEISHSYPFDWRTKKPIIWRAVPQWFASVSKFR 
QEILDEIEKVKFHSEWGKVRIiYNMIRDRGDWISRQRAWGVPLPIFYAEDGTAIMVAETIEHVAQLFEEH 
GS S IWWERDAKDLLPEGFTHPGS PNGEFKKETDIMDVWFDS GS SWNGVWNRPELTYPADL YLEGSDQYR 
GWFNSSLITSVANHGVAPYKQILSQGFALDGKGEKMSKSLGNTIAPSDVEKQFGAEILRLWVTSVDSSND 
VRISMDILSQVSETYRKIRNTLRFLIANTSDFNPAQDTVAYDELRSVDKYMTIRFNQLVKTIRDAYADFE 
FLTIYKALVNFINVDLSAFYLDFAKDVVYIEGAKSLERRQMQTVFYDILVKITKLLTPILPHTAEEIWSY 
LEFETEDFVQLSELPEVQTFANQEEILDTWAAFMDFRGQAQKALEEARNAKVIGKSLEAHLTVYPNEVVK 
TLLEAVNSNVAQLLIVSEIiTIAEGPAPEAALSFEDVAFTVERATGEVCDRCRRIDPTTAERSYQAVICDH 
CAS I VEENFAEAVAEGFEEK 

SEQ ID NO: 275 
>AnrP164745 

MSKEIKFSSDARSAWRGVDILADTVKVTLGPKGRNVVLEKSFGSPLITNDGVTIAKEIELEDHFENM 

KLVS EVAS KTND I AGDGTTTATVLTQAI VREGI KNVTAGANP I G I RRGI ETAVAAAVEALKNNAI PVANK 

EAIAQVAAVSSRSEKVGEYISEAMEKVGKDGVITIEESRGMETELEWEGMQFDRGYLSQYWTDSEKW 

ADLENPYILITDKKISNIQEILPIiLESILQSNRPLLIIADDVDGEALPTLVLNKIRGTFNWAVKAPGFG 

DRRKAMLEDIAILTGGTVITEDLGLELKDATIEALGQAARVTVDKDSTVIVEGAGNPEAISHRVAVIKSQ 

IETTTSEFDREKIiQERIiAKLSGGVAVIKVGAATETELKEMKLRIEDALNATRAAVEEGIVAGGGTALANV 

IPAVATLELTGDEATGRNIVLRALEEPVRQIAHNAGFEGSIVIDRLKNAELGIGFNAATGEVmMIDQGI 

IDPVKVSRSALQNAASVASLILTTEAWANKPEPVAPAPAMDPSMMGGMM 

SEQ ID NO: 276 
>AnrP792414 

MLKPIiGDRLVLKVEEKEQTVGGFVLAGSAQEKTKTAQWATGQGVRTLNGDLVAPSVKTGDRVLVEAHAG 
LDVKDGDEKYI IVGEANILAI IEE 

SEQ ID NO: 277 
>AnrP257166 

MANKAVNDFILAMNYDKKKLL^ 
YPGALLVVDETLLENNPTLL^ 

VNNVPARMQYEKITAHSMEQLKVKFGSDFEKTGNSLDIDFNSVHSGEKQIQIVNFKQIYYTVSVDAVKNP 
GDVFQDTVTVEDLKQRGI S AERPLVYI S S VAYGRQVYLKLETTSKSDEVEAAFEAL I KGVKVAPQTEWKQ 
ILDNTEVKAVILGGDPSSGARVVTGKVDMVEDLIQEG 

VETKVTAYRNGDLLLDHSGAYVAQYYITWNELS YDHQGKEVLTPKAWDRNGQDLTAHFTTS I PLKGNVRN 
L S VKI RECTGLAWEWWRTVYEKTDL PLVRKRT I S I WGTTLYPQVEDKVEND 

SEQ ID NO: 278 
>AnrP972554 

MNTKELIASELSSIIDSLDQEAILKIjLETPKNSEMGDIAFPAFSLAKA/'ERKAPQMIAAELAEKMNSQAFE 
KWATGPYWFFLDKSAISAQVLQAVTTEKEHYADQNIGKQENWIDMSSPNIAKPFSIGHLRSTVIGDS 
LSHIFQKIGYQTVKVNHLGDWGKQFGMLIVAYKK^ 
REWFRKLENGDEEALALWQWFRDESLVEFNRLYNELKVEFDSYN^ 
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GAQWNLEKYGIEHPALIKKSDGATLYITRDLAAALYRKNEYQFAKSIYWGQEQSAHFKQLKAVLQEMG 
YDWSDD ITHVPFGLVTKEGKKL S TRKGNVI LLE PTVAEAVSRAKVQ I EAKNP ELENKDQVAHAVG I GAI K 
FYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYA 

FPRIINRAADNFEPSIIAKFAISLAQSFNKYYAHTRILDESPERDSRLALSYATAVVLKEALRLLGVEAP 
EKM 



SEQ ID NO: 279 
>AnrP659187 

MHIFDELKERGLIFQTTDEEALRKALEEGQVSYYTGYDPTADSLHLGHLVAILTSRRLQLAGHKPYALVG 
GATGLIGDPSFKDAERSLQTKDTVIDGWVKSIQGQLSRFLDFENGE 

KYFTWraMSKESVKKRIETGISYTEFAYQIMQGYDFFVLNQDHWTLQIGGSDQWGISIM 

DKTGHVITVPLITDATGKKFGKSEGNAVWL^ 

RKQFEAAPHERLAQKVLAREWTLVTiGEEAYKEAL^^ 

NNNIVELLVSSGIVNSK^ 

SEQ ID NO: 280 
>AnrP957869 

MIKYSIRGENLEVTEAIRDYWSKLEKIEKYFQPEQELDARINLKWREKTAKVEVTIPLGSITLRAEDV 
SQDMYGSIDLVTDKIERQIRKNKTKIERKNKNKVATGQLFTDALVEDSNIVQSKWRSKQIDLKPMDLEE 
AILQMDLLGHDFFIYVDVEDQTTNVIYRREDGEIGLLEVKES 

SEQ ID NO: 2 81 
>AnrP904896 

maeitaklvkelreksgagvmdakkalvetdgdi ekai ellrekgmakaakkadrvaaegltgvyvngnv 
aavievnaetdfvaknaqfvelwttakviaegkpanneealalimpsgetleaayvsatatigekisfr 
rfaliektdaqhfgayqhnggrigvisweggdealakqiismhiaamkptvlsykeiideqfvkdelaqln 
hvidqdnesramvnkpalphlkygskaq 

TKVDQAYTLLiAQVY IMDD SKTVEAYL E S VNAS WEFARFEVGEGI EKAANDF E AEVAATMAAALNN 

SEQ ID NO: 2 82 
>AnrP9 66090 

MIHFSINKNLFLQALNTTKRAISSKNAIPILSTVKIDVTNEGITLIGSNGQISIENFISQKNEDAGLLIT 
SLGSILLEASFFINWSSLPDVTLDFKEIEQNQIVLTSGKSEITLKGKDSEQYPRIQEISASTPLILETK 
LLKKIINETAFAASTQESRPILTGVHFVLSQHKELKTVATDSHRLSQKKLTIiEKNSDDFDWIPSRSLRE 
FSAVFTDDIETVEIFFANNQILFRSENISFYTRLLEGNYPDTDRLIPTDFNTTITFNVVNLRQSMERARL 
LSSATQNGTVKLEIKDGWSAHWSPEVGKVNEEIDTDQVTGEDLTISFNPTYLIDSLKALNSEKVTISF 
ISAVRPFTLVPADTDEDFMQLITPVRTN 



SEQ ID NO: 2 83 

Lipoate-protein ligase A [Streptococcus pyogenes MGAS6180] 
mkyivnkshn pafnialeay afrelveede lfilwinepa iiigkhqnti qeinkeyide 
hgihwrrls gggavyhdln nlnytiisnk taegafdfkt fsqpviatla dlgvtanftg 
rndieidgkk icgnaqayyk grmmhhgcll fdvdmtvlgd alkvskdkie skgvksvrar 
vtnilnelpe kitveefsdk iltkmketyp dmteyvlsed elakieqsak eqfgswdwty 
gkapeytier nvrypagkis tfanvensii knlkiygdff gikdvqdien lligckyeyr 
dvfeclktid ttqyf srmtv eevakaivs 



SEQ ID NO: 2 84 

Hypothetical protein gbs0899 [Streptococcus agalactiae NEM316] 
mkyivntsnd paynvaleay afqkltdide ifilwinepa iiigrhqnti qeinkefidk 
ngihwrrls gggavyhdln nlnytiisnn tqegafdfqt fskpvidtla klgvkaeftg 
rndleingqk fagnaqayyk grmmhhgcll fdvdmsvlgq alkvskdkie skgiksvrar 
vtnivdhlsd kitvqefsda ilaqmkeeyp emdeyvlsda elseiqamrd nqfatwdwty 
gkapeytier gvrypagkit tyanvensti ksvkifgdff gvkpvddiek mlegvrydyk 
dvlaalktvd tsqyfsrmtp eeitkaivd 



SEQ ID NO: 2 85 

Hypothetical protein lmo0931 [Listeria monocytogenes EGD-e] 
myfidnnnek dprinlavee filtelnlde pvllfyinkp siiigrnqnt veeidteyve 
kndviwrrl sgggavyhde gnlnfsfite ddgesfhnfa kftqpiveal krlgvnaelk 
grndllidgf kvsgnaqfat kgkmfshgtl mydlnldnva aslkprkdki eskgiksvrs 
rvanisdfmd qemtteefrd llllyifgve kvedvkeykl taadwekihe isakrygnwd 
wnygkspkfd ltrtkrfpvg avdvrlnvqk gvitdikifg dffgvknvad ieeklvntty 
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krevlaealv didvkeyfgn itkdefldll y 
SEQ ID NO: 286 

ATP-dependent Clp protease proteolytic subunit [Streptococcus 
pyogenes MGAS618 0] 

mipwieqts rgersydiys rllkdriiml tgpvednman sviaqllfld aqdntkdiyl 
yvntpggsvs aglaivdtmn fikadvqtiv mgmaasmgtv iassgtkgkr fmlpnaeymi 
hqpmggtggg tqqtdmaiaa ehllktrhrl ekilaqnagk tikqihkdae rdywmsaeet 
Itygfideim ennelk 

SEQ ID NO: 287 

Streptococcus agalactiae clpP gene for ClpP serine protease 
mipwieqts rgersydiys rllkdriiml tgqvednman siiaqllfld aqdntkdiyl 
yvntpggsvs aglaivdtmn fiksdvqtiv mgmaasmgti iassgakgkr fmlpnaeymi 
hqpmggtggg tqqsdmaiaa ehllktrhtl ekiladnsgq siekvhddae rdrwmsaqet 
ldygfidaim ennnlq 

SEQ ID NO: 2 88 

ATP-dependent Clp protease proteolytic subunit [Listeria monocytogenes] 
mnliptvieq tsrgeraydi ysrllkdrii mlgsaiddnv ansivsqllf ldaqdpekdi 
flyinspggs isagmaiydt mnfvkadvqt igmgmaasmg sflltagang krfalpnaei 
mihqplggaq gqateieiaa rhilkikerm ntimaektgq pyeviardtd rdnfmtaqea 
kdygliddii inksglkg 

SEQ ID NO: 289: DLSVLA 

SEQ ID NO: 290: IINELPK 

SEQ ID NO: 291: IDG 

SEQ ID NO: 292: SKDKFESKGVKSVRA 

SEQ ID NO: 293: VEKFRDLLLE 

SEQ ID NO: 294: KKEYP 



SEQ ID NO: 295 : 

>AnrP51602 9 fused to E. coli heat-stable enterotoxin ST-Ia 
lyrZYLPSFSQKAESVDSSKEKITLDTKK^ 

SIIVGRHQNTIEEINRDYVRENGIEVVRRISGGGAVYHDLNNLNYTIISKEDENKA 

FTGRNDLEIDGKKFCGNAQAYINGRIMHHGCLLFDVDL^ 

KFRDLLLEYMKKEYPEMTEYVFSEEELAEINRIKDTO 

KIYGDFFGIEDVAAVEDVLRGVKYEREDVTjKAIjKTIDITRYFAGISREEIAEAVVG 
SEQ ID NO: 296: 

>AnrP516029 fused to E.coli heat-stable enterotoxin EASTl 

MPSTQYIRRPASSYASCIWCATACASCHGRTTKPSIiATKYIINHSNDTAFNIALEEYAFKHLLDEDQIFLLWINKPSI 
IVGRHQNTIEEINRDWRENGIEWRRISGGGAWHDLNNLNYTIISKEDENKAFDFKSFSTPVINTLAQLGVKAEFT 
GRNDLEIDGKKFCGNAQAYINGRIMHHGCLLFDVDLSVLANALKVSKDKFESKGVKSVRARVTNIINELPKKITVEKF 
RDLLLEYMKKEYPEMTEWFSEEELAEINRIKDTKFG™^ 

YGDFFGI EDVAAVED VXiRGVKYEREDVLKALKT I DITRYFAGI SREE I AEAWG 



WO 2006/084467 



PCT/DK2006/000073 



43/47 



Fig. 3 
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Fig. 4 
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Fig. 5 
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Fig. 6 
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